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Preface 


This book is the fruit of for many years teaching the introduction to quan¬ 
tum mechanics to second-year students of physics at Oxford University. We 
have tried to convey to students that it is the use of probability amplitudes 
rather than probabilities that makes quantum mechanics the extraordinary 
thing that it is, and to grasp that the theory’s mathematical structure follows 
almost inevitably from the concept of a probability amplitude. We have also 
tried to explain how classical mechanics emerges from quantum mechanics. 
Classical mechanics is about movement and change, while the strong empha¬ 
sis on stationary states in traditional quantum courses makes the quantum 
world seem static and irreconcilably different from the world of every-day 
experience and intuition. By stressing that stationary states are merely the 
tool we use to solve the time-dependent Schrodinger equation, and presenting 
plenty of examples of how interference between stationary states gives rise 
to familiar dynamics, we have tried to pull the quantum and classical worlds 
into alignment, and to help students to extend their physical intuition into 
the quantum domain. 

Traditional courses use only the position representation. If you step 
back from the position representation, it becomes easier to explain that the 
familiar operators have a dual role: on the one hand they are repositories of 
information about the physical characteristics of the associated observable, 
and on the other hand they are the generators of the fundamental symmetries 
of space and time. These symmetries are crucial for, as we show already in 
Chapter 4, they dictate the canonical commutation relations, from which 
much follows. 

Another advantage of down-playing the position representation is that it 
becomes more natural to solve eigenvalue problems by operator methods than 
by invoking Frobenius’ method for solving differential equations in series. A 
careful presentation of Frobenius’ method is both time-consuming and rather 
dull. The job is routinely bodged to the extent that it is only demonstrated 
that in certain circumstances a series solution can be found, whereas in 
quantum mechanics we need assurance that all solutions can be found by this 
method, which is a priori implausible. We solve all the eigenvalue problems 
we encounter by rigorous operator methods and dispense with solution in 
series. 

By introducing the angular momentum operators outside the position 
representation, we give them an existence independent of the orbital angular- 
momentum operators, and thus reduce the mystery that often surrounds 
spin. We have tried hard to be clear and rigorous in our discussions of the 
connection between a body’s spin and its orientation, and the implications of 
spin for exchange symmetry. We treat hydrogen in fair detail, helium at the 
level of gross structure only, and restrict our treatment of other atoms to an 
explanation of how quantum mechanics explains the main trends of atomic 
properties as one proceeds down the periodic table. Many-electron atoms 
are extremely complex systems that cannot be treated in a first course with 
a level of rigour with which we are comfortable. 

Scattering theory is of enormous practical importance and raises some 
tricky conceptual questions. Chapter 5 on motion in one-dimensional step 
potentials introduces many of the key concepts, such as the connection be¬ 
tween phase shifts and the scattering cross section and how and why in 
resonant scattering sensitive dependence of phases shifts on energy gives rise 
to sharp peaks in the scattering cross section. In Chapter 12 we discuss fully 
three-dimensional scattering in terms of the S-matrix and partial waves. 

In most branches of physics it is impossible in a first course to bring 
students to the frontier of human understanding. We are fortunate in be¬ 
ing able to do this already in Chapter 6, which introduces entanglement and 
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quantum computing, and closes with a discussion of the still unresolved prob¬ 
lem of measurement. Chapter 6 also demonstrates that thermodynamics is 
a straightforward consequence of quantum mechanics and that we no longer 
need to derive the laws of thermodynamics through the traditional, rather 
subtle, arguments about heat engines. 

We assume familiarity with complex numbers, including de Moivre’s 
theorem, and familiarity with first-order linear ordinary differential equa¬ 
tions. We assume basic familiarity with vector calculus and matrix algebra. 
We introduce the theory of abstract linear algebra to the level we require 
from scratch. Appendices contain compact introductions to tensor notation, 
Fourier series and transforms, and Lorentz covariance. 

Every chapter concludes with an extensive list of problems for which 
solutions are available. The solutions to problems marked with an asterisk, 
which tend to be the harder problems, are available online 1 and solutions to 
other problems are available to colleagues who are teaching a course from the 
book. In nearly every problem a student will either prove a useful result or 
deepen his/her understanding of quantum mechanics and what it says about 
the material world. Even after successfully solving a problem we suspect 
students will find it instructive and thought-provoking to study the solution 
posted on the web. 

We are grateful to several colleagues for comments on the first two edi¬ 
tions, particularly Justin Wark for alerting us to the problem with the singlet- 
triplet splitting. Fabian Essler, Andre Lukas, John March-Russell and Laszlo 
Solymar made several constructive suggestions. We thank Artur Ekert for 
stimulating discussions of material covered in Chapter 6 and for reading that 
chapter in draft form. 

June 2012 James Binney 

David Skinner 


1 http://www-thphys.physics.ox.ac.uk/people/JamesBinney/QBhome.htm 
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Probability and probability 
amplitudes 


The future is always uncertain. Will it rain tomorrow? Will Pretty Lady win 
the 4.20 race at Sandown Park on Tuesday? Will the Financial Times All 
Shares index rise by more than 50 points in the next two months? Nobody 
knows the answers to such questions, but in each case we may have infor¬ 
mation that makes a positive answer more or less appropriate: if we are in 
the Great Australian Desert and it’s winter, it is exceedingly unlikely to rain 
tomorrow, but if we are in Delhi in the middle of the monsoon, it will almost 
certainly rain. If Pretty Lady is getting on in years and hasn’t won a race yet, 
she’s unlikely to win on Tuesday either, while if she recently won a couple of 
major races and she’s looking fit, she may well win at Sandown Park. The 
performance of the All Shares index is hard to predict, but factors affecting 
company profitability and the direction interest rates will move, will make 
the index more or less likely to rise. Probability is a concept which enables 
us to quantify and manipulate uncertainties. We assign a probability p = 0 
to an event if we think it is simply impossible, and we assign p — 1 if we 
think the event is certain to happen. Intermediate values for p imply that 
we think an event may happen and may not, the value of p increasing with 
our confidence that it will happen. 

Physics is about predicting the future. Will this ladder slip when I 
step on it? How many times will this pendulum swing to and fro in an 
hour? What temperature will the water in this thermos be at when it has 
completely melted this ice cube? Physics often enables us to answer such 
questions with a satisfying degree of certainty: the ladder will not slip pro¬ 
vided it is inclined at less than 23.34° to the vertical; the pendulum makes 
3602 oscillations per hour; the water will reach 6.43°C. But if we are pressed 
for sufficient accuracy we must admit to uncertainty and resort to probability 
because our predictions depend on the data we have, and these are always 
subject to measuring error, and idealisations: the ladder’s critical angle de¬ 
pends on the coefficients of friction at the two ends of the ladder, and these 
cannot be precisely given because both the wall and the floor are slightly 
irregular surfaces; the period of the pendulum depends slightly on the am¬ 
plitude of its swing, which will vary with temperature and the humidity of 
the air; the final temperature of the water will vary with the amount of heat 
transferred through the walls of the thermos and the speed of evaporation 
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from the water’s surface, which depends on draughts in the room as well as 
on humidity. If we are asked to make predictions about a ladder that is in¬ 
clined near its critical angle, or we need to know a quantity like the period of 
the pendulum to high accuracy, we cannot make definite statements, we can 
only say something like the probability of the ladder slipping is 0.8, or there 
is a probability of 0.5 that the period of the pendulum lies between 1.0007 s 
and 1.0004 s. We can dispense with probability when slightly vague answers 
are permissible, such as that the period is 1.00 s to three significant figures. 
The concept of probability enables us to push our science to its limits, and 
make the most precise and reliable statements possible. 

Probability enters physics in two ways: through uncertain data and 
through the system being subject to random influences. In the first case we 
could make a more accurate prediction if a property of the system, such as the 
length or temperature of the pendulum, were more precisely characterised. 
That is, the value of some number is well defined, it’s just that we don’t 
know the value very accurately. The second case is that in which our system 
is subject to inherently random influences - for example, to the draughts 
that make us uncertain what will be the final temperature of the water. 
To attain greater certainty when the system under study is subject to such 
random influences, we can either take steps to increase the isolation of our 
system - for example by putting a lid on the thermos - or we can expand the 
system under study so that the formerly random influences become calculable 
interactions between one part of the system and another. Such expansion 
of the system is not a practical proposition in the case of the thermos - the 
expanded system would have to encompass the air in the room, and then 
we would worry about fluctuations in the intensity of sunlight through the 
window, draughts under the door and much else. The strategy does work 
in other cases, however. For example, climate changes over the last ten 
million years can be studied as the response of a complex dynamical system 
- the atmosphere coupled to the oceans - that is subject to random external 
stimuli, but a more complete account of climate changes can be made when 
the dynamical system is expanded to include the Sun and Moon because 
climate is strongly affected by the inclination of the Earth’s spin axis to the 
plane of the Earth’s orbit and the Sun’s coronal activity. 

A low-mass system is less likely to be well isolated from its surroundings 
than a massive one. For example, the orbit of the Earth is scarcely affected 
by radiation pressure that sunlight exerts on it, while dust grains less than a 
few microns in size that are in orbit about the Sun lose angular momentum 
through radiation pressure at a rate that causes them to spiral in from near 
the Earth to the Sun within a few millennia. Similarly, a rubber duck left 
in the bath after the children have got out will stay very still, while tiny 
pollen grains in the water near it execute Brownian motion that carries 
them along a jerky path many times their own length each minute. Given 
the difficulty of isolating low-mass systems, and the tremendous obstacles 
that have to be surmounted if we are to expand the system to the point at 
which all influences on the object of interest become causal, it is natural that 
the physics of small systems is invariably probabilistic in nature. Quantum 
mechanics describes the dynamics of all systems, great and small. Rather 
than making firm predictions, it enables us to calculate probabilities. If the 
system is massive, the probabilities of interest may be so near zero or unity 
that we have effective certainty. If the system is small, the probabilistic 
aspect of the theory will be more evident. 

The scale of atoms is precisely the scale on which the probabilistic aspect 
is predominant. Its predominance reflects two facts. First, there is no such 
thing as an isolated atom because all atoms are inherently coupled to the 
electromagnetic field, and to the fields associated with electrons, neutrinos, 
quarks, and various ‘gauge bosons’. Since we have incomplete information 
about the states of these fields, we cannot hope to make precise predictions 
about the behaviour of an individual atom. Second, we cannot build mea¬ 
suring instruments of arbitrary delicacy. The instruments we use to measure 
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atoms are usually themselves made of atoms, and employ electrons or pho¬ 
tons that carry sufficient energy to change an atom significantly. We rarely 
know the exact state that our measuring instrument is in before we bring it 
into contact with the system we have measured, so the result of the measure¬ 
ment of the atom would be uncertain even if we knew the precise state that 
the atom was in before we measured it, which of course we do not. More¬ 
over, the act of measurement inevitably disturbs the atom, and leaves it in a 
different state from the one it was in before we made the measurement. On 
account of the uncertainty inherent in the measuring process, we cannot be 
sure what this final state may be. Quantum mechanics allows us to calculate 
probabilities for each possible final state. Perhaps surprisingly, from the the¬ 
ory it emerges that even when we have the most complete information about 
the state of a system that is is logically possible to have, the outcomes of 
some measurements remain uncertain. Thus whereas in the classical world 
uncertainties can be made as small as we please by sufficiently careful work, 
in the quantum world uncertainty is woven into the fabric of reality. 


1.1 The laws of probability 

Events are frequently one-offs: Pretty Lady will run in the 4.20 at Sandown 
Park only once this year, and if she enters the race next year, her form and 
the field will be different. The probability that we want is for this year’s 
race. Sometimes events can be repeated, however. For example, there is 
no obvious difference between one throw of a die and the next throw, so 
it makes sense to assume that the probability of throwing a 5 is the same 
on each throw. When events can be repeated in this way we seek to assign 
probabilities in such a way that when we make a very large number N of 
trials, the number n A of trials in which event A occurs (for example 5 comes 
up) satisfies 

n A — PaN- ( 1 - 1 ) 

In any realistic sequence of throws, the ratio tia/N will vary with N, while 
the probability p A does not. So the relation (1.1) is rarely an equality. The 
idea is that we should choose p A so that n A /N fluctuates in a smaller and 
smaller interval around p A as N is increased. 

Events can be logically combined to form composite events: if A is the 
event that a certain red die falls with 1 up, and B is the event that a white 
die falls with 5 up, AB is the event that when both dice are thrown, the red 
die shows 1 and the white one shows 5. If the probability of A is p A and the 
probability of B is ps , then in a fraction ~ p A of throws of the two dice the 
red die will show 1, and in a fraction ~ pb of these throws, the white die 
will have 5 up. Hence the fraction of throws in which the event AB occurs is 
~ PaPb so we should take the probability of AB to be pab = PaPb- In this 
example A and B are independent events because we see no reason why 
the number shown by the white die could be influenced by the number that 
happens to come up on the red one, and vice versa. The rule for combining 
the probabilities of independent events to get the probability of both events 
happening, is to multiply them: 

p(A and B) = p(A)p(B) (independent events). (1.2) 

Since only one number can come up on a die in a given throw, the 
event A above excludes the event C that the red die shows 2; A and C are 
exclusive events. The probability that either a 1 or a 2 will show is obtained 
by adding p A and pc ■ Thus 

p(A or C) = p(A) + p(C) (exclusive events). (1.3) 


In the case of reproducible events, this rule is clearly consistent with the 
principle that the fraction of trials in which either A or C occurs should be 
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the sum of the fractions of the trials in which one or the other occurs. If 
we throw our die, the number that will come up is certainly one of 1, 2, 3, 
4, 5 or 6. So by the rule just given, the sum of the probabilities associated 
with each of these numbers coming up has to be unity. Unless we know that 
the die is loaded, we assume that no number is more likely to come up than 
another, so all six probabilities must be equal. Hence, they must all equal 
g. Generalising this example we have the rules 


N 

With just N mutually exclusive outcomes, E Pi = l. 

i=1 

If all outcomes are equally likely, p t = 1 /N. 


(1.4) 


1.1.1 Expectation values 

A random variable £ is a quantity that we can measure and the value that 
we get is subject to uncertainty. Suppose for simplicity that only discrete 
values Xi can be measured. In the case of a die, for example, x could be the 
number that comes up, so x has six possible values, xi = 1 to xe = 6. If pi 
is the probability that we shall measure ay, then the expectation value of 
x is 

( x) = ^2 piXi. ( 1 . 5 ) 

i 

If the event is reproducible, it is easy to show that the average of the values 
that we measure on N trials tends to (a:) as N becomes very large. Conse¬ 
quently, (a:) is often referred to as the average of x. 

Suppose we have two random variables, x and y. Let p l3 be the proba¬ 
bility that our measurement returns Xi for the value of x and y 3 for the value 
of y. Then the expectation of the sum x + y is 

(x + y) = J^PiA^ + Vo) = +J2 p dVj ( 1 - 6 ) 

ij ij ij 


But Pij is the probability that we measure Xi regardless of what we 
measure for y, so it must equal p^. Similarly 'Y2 i Pij = Pj, the probability of 
measuring yj irrespective of what we get for x. Inserting these expressions 
in to (1.6) we find 

{x + y) = {x) + (y) . (1.7) 

That is, the expectation value of the sum of two random variables is the 
sum of the variables’ individual expectation values, regardless of whether 
the variables are independent or not. 

A useful measure of the amount by which the value of a random variable 
fluctuates from trial to trial is the variance of x: 

{{x - {x)) 2 ) = (x 2 ) -2{x {x)) + ((z) 2 ) , (1.8) 

where we have made use of equation (1.7). The expectation (x) is not a 
random variable, but has a definite value. Consequently {x (x)) = {x) 2 and 
( 1 {x) 2 ^ = (x ) 2 , so the variance of x is related to the expectations of x and 
x 2 by 

(A^> = {(x — (x)) 2 ) = (x 2 ) — (x) 2 . 


(1.9) 
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^>y ^- l- 

Figure 1.1 The two-slit interference experiment. 


1.2 Probability amplitudes 

Many branches of the social, physical and medical sciences make extensive 
use of probabilities, but quantum mechanics stands alone in the way that it 
calculates probabilities, for it always evaluates a probability p as the mod- 
square of a certain complex number A: 

p=\A\\ (1.10) 

The complex number A is called the probability amplitude for p. 

Quantum mechanics is the only branch of knowledge in which proba¬ 
bility amplitudes appear, and nobody understands why they arise. They 
give rise to phenomena that have no analogues in classical physics through 
the following fundamental principle. Suppose something can happen by two 
(mutually exclusive) routes, S or T, and let the probability amplitude for it 
to happen by route S be A(S) and the probability amplitude for it to happen 
by route T be A(T). Then the probability amplitude for it to happen by one 
route or the other is 


A{S orT)=A(S) + A(T). (1.11) 

This rule takes the place of the sum rule for probabilities, equation (1.3). 
However, it is incompatible with equation (1.3), because it implies that the 
probability that the event happens regardless of route is 

p(S or T) = \A(S or T )| 2 = |A(S) + A(T )| 2 

= lA(S')! 2 + A(S)A*(T) + A*(S)A(T) + \A(T )\ 2 (1.12) 

= p(S) + p(T) + 25ft e{A{S)A*{T)). 

That is, the probability that an event will happen is not merely the sum 
of the probabilities that it will happen by each of the two possible routes: 
there is an additional term 25fte(H(5)H*(T)). This term has no counterpart 
in standard probability theory, and violates the fundamental rule (1.3) of 
probability theory. It depends on the phases of the probability amplitudes 
for the individual routes, which do not contribute to the probabilities p(S) = 
|H(S)| 2 of the routes. 

Whenever the probability of an event differs from the sum of the prob¬ 
abilities associated with the various mutually exclusive routes by which it 
can happen, we say we have a manifestation of quantum interference. 
The term 25fte(H(S')H*(T)) in equation (1.12) is what generates quantum 
interference mathematically. We shall see that in certain circumstances the 
violations of equation (1.3) that are caused by quantum interference are not 
detectable, so standard probability theory appears to be valid. 

How do we know that the principle (1.11), which has these extraordinary 
consequences, is true? The soundest answer is that it is a fundamental 
postulate of quantum mechanics, and that every time you look at a digital 
watch, or touch a computer keyboard, or listen to a CD player, or interact 
with any other electronic device that has been engineered with the help 
of quantum mechanics, you are testing and vindicating this theory. Our 
civilisation now quite simply depends on the validity of equation (1.11). 
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Figure 1.2 The probability distribu¬ 
tions of passing through each of the 
two closely spaced slits overlap. 


1.2.1 Two-slit interference 

An imaginary experiment will clarify the physical implications of the prin¬ 
ciple and suggest how it might be tested experimentally. The apparatus 
consists of an electron gun, G, a screen with two narrow slits Si and S 2 , 
and a photographic plate P, which darkens when hit by an electron (see 
Figure 1.1). 

When an electron is emitted by G, it has an amplitude to pass through 
slit Si and then hit the screen at the point x. This amplitude will clearly 
depend on the point x, so we label it Ai ( x ). Similarly, there is an amplitude 
A 2 (x) that the electron passed through S 2 before reaching the screen at x. 
Hence the probability that the electron arrives at x is 

P(x) = \Ax(x) + A 2 {x)\ 2 = \Ai(x)\ 2 + |A 2 (a:)| 2 + 25ie(Ai ( 2 ^ 2 ( 21 )). (1.13) 

|Ai(a;)| 2 is simply the probability that the electron reaches the plate after 
passing through Si. We expect this to be a roughly Gaussian distribution 
Pi(x) that is centred on the value X\ of x at which a straight line from G 
through the middle of Si hits the plate. |A 2 (a:)| 2 should similarly be a roughly 
Gaussian function p 2 (x) centred on the intersection at x 2 of the screen and 
the straight line from G through the middle of S 2 . It is convenient to write 
Ai = |Aj|e 1< ^’ i = y/pie 1 ^, where </>* is the phase of the complex number Aj. 
Then equation (1.13) can be written 

p(x) = p!{x)+p 2 {x) + I(x), (1.14a) 

where the interference term I is 

I{x) = 2\/p 1 (x)p 2 (x) cos(^i(a;) - (j> 2 (x)). (1.14b) 

Consider the behaviour of I(x) near the point that is equidistant from the 
slits. Then (see Figure 1.2) pi ~ p 2 and the interference term is comparable 
in magnitude to p\ + p 2 , and, by equations (1-14), the probability of an 
electron arriving at x will oscillate between ~ 2pi and 0 depending on the 
value of the phase difference <f>i(x) — (j> 2 {x). In §2.3.4 we shall show that the 
phases <pi(x) are approximately linear functions of x, so after many electrons 
have been fired from G to P in succession, the blackening of P at x, which 
will be roughly proportional to the number of electrons that have arrived at 
x, will show a sinusoidal pattern. 

Let’s replace the electrons by machine-gun bullets. Then everyday ex¬ 
perience tells us that classical physics applies, and it predicts that the prob¬ 
ability p(x) of a bullet arriving at x is just the sum p\(x) + p 2 (x) of the 
probabilities of a bullet coming through Si or S 2 . Hence classical physics 
does not predict a sinusoidal pattern in p(x). How do we reconcile the very 
different predictions of classical and quantum mechanics? Firearms manufac¬ 
turers have for centuries used classical mechanics with deadly success, so is 
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the resolution that bullets do not obey quantum mechanics? We believe they 
do, and the probability distribution for the arrival of bullets should show a 
sinusoidal pattern. However, in §2.3.4 we shall find that quantum mechanics 
predicts that the distance A between the peaks and troughs of this pattern 
becomes smaller and smaller as we increase the mass of the particles we are 
firing through the slits, and by the time the particles are as massive as a 
bullet, A is fantastically small ~ 10~ 29 m. Consequently, it is not exper¬ 
imentally feasible to test whether p{x) becomes small at regular intervals. 
Any feasible experiment will probe the value of p(x) averaged over many 
peaks and troughs of the sinusoidal pattern. This averaged value of p{x) 
agrees with the probability distribution we derive from classical mechanics 
because the average value of I(x) in equation (1.14) vanishes. 

1.2.2 Matter waves? 

The sinusoidal pattern of blackening on P that quantum mechanics predicts 
proves to be identical to the interference pattern that is observed in Young’s 
double-slit experiment. This experiment established that light is a wave phe¬ 
nomenon because the wave theory could readily explain the existence of the 
interference pattern. It is natural to infer from the existence of the sinusoidal 
pattern in the quantum-mechanical case, that particles are manifestations of 
waves in some medium. There is much truth in this inference, and at an 
advanced level this idea is embodied in quantum field theory. However, in 
the present context of non-relativistic quantum mechanics, the concept of 
matter waves is unhelpful. Particles are particles, not waves, and they pass 
through one slit or the other. The sinusoidal pattern arises because proba¬ 
bility amplitudes are complex numbers, which add in the same way as wave 
amplitudes. Moreover, the energy density (intensity) associated with a wave 
is proportional to the mod square of the wave amplitude, just as the proba¬ 
bility density of finding a particle is proportional to the mod square of the 
probability amplitude. Hence, on a mathematical level, there is a one-to-one 
correspondence between what happens when particles are fired towards a 
pair of slits and when light diffracts through similar slits. But we cannot 
consistently infer from this correspondence that particles are manifestations 
of waves because quantum interference occurs in quantum systems that are 
much more complex than a single particle, and indeed in contexts where 
motion through space plays no role. In such contexts we cannot ascribe the 
interference phenomenon to interference between real physical waves, so it is 
inconsistent to take this step in the case of single-particle mechanics. 


1.3 Quantum states 

1.3.1 Quantum amplitudes and measurements 

Physics is about the quantitative description of natural phenomena. A quan¬ 
titative description of a system inevitably starts by defining ways in which 
it can be measured. If the system is a single particle, quantities that we can 
measure are its x, y and 2 coordinates with respect to some choice of axes, 
and the components of its momentum parallel to these axes. We can also 
measure its energy, and its angular momentum. The more complex a system 
is, the more ways there will be in which we can measure it. 

Associated with every measurement, there will be a set of possible nu¬ 
merical values for the measurement - the spectrum of the measurement. 
For example, the spectrum of the x coordinate of a particle in empty space 
is the interval (— 00 , 00 ), while the spectrum of its kinetic energy is (0, 00 ). 
We shall encounter cases in which the spectrum of a measurement con¬ 
sists of discrete values. For example, in Chapter 7 we shall show that 
the angular momentum of a particle parallel to any given axis has spec¬ 
trum (..., (k — 1 )h,kh,(k + 1 where Ti is Planck’s constant h = 
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6.63 x 10 -34 J s divided by 2n , and k is either 0 or i. When the spectrum is 
a set of discrete numbers, we say that those numbers are the allowed values 
of the measurement. 

With every value in the spectrum of a given measurement there will be 
a quantum amplitude that we will find this value if we make the relevant 
measurement. Quantum mechanics is the science of how to calculate such 
amplitudes given the results of a sufficient number of prior measurements. 

Imagine that you’re investigating some physical system: some particles 
in an ion trap, a drop of liquid helium, the electromagnetic field in a resonant 
cavity. What do you know about the state of this system? You have two types 
of knowledge: ( 1 ) a specification of the physical nature of the system (e.g., 
size & shape of the resonant cavity), and ( 2 ) information about the current 
dynamical state of the system. In quantum mechanics information of type 
(1) is used to define an object called the Hamiltonian H of the system that 
is defined by equation (2.5) below. Information of type (2) is more subtle. 
It must consist of predictions for the outcomes of measurements you could 
make on the system. Since these outcomes are inherently uncertain, your 
information must relate to the probabilities of different outcomes, and in the 
simplest case consists of values for the relevant probability amplitudes. For 
example, your knowledge might consist of amplitudes for the various possible 
outcomes of a measurement of energy, or of a measurement of momentum. 

In quantum mechanics, then, knowledge about the current dynamical 
state of a system is embodied in a set of quantum amplitudes. In classical 
physics, by contrast, we can state with certainty which value we will measure, 
and we characterise the system’s current dynamical state by simply giving 
this value. Such values are often called ‘coordinates’ of the system. Thus 
in quantum mechanics a whole set of quantum amplitudes replaces a single 
number. 

Complete sets of amplitudes Given the amplitudes for a certain set of 
events, it is often possible to calculate amplitudes for other events. The phe¬ 
nomenon of particle spin provides the neatest illustration of this statement. 

Electrons, protons, neutrinos, quarks, and many other elementary par¬ 
ticles turn out to be tiny gyroscopes: they spin. The rate at which they 
spin and therefore the the magnitude of their spin angular momentum never 
changes; it is always ^3/4 Ti. Particles with this amount of spin are called 
spin-half particles for reasons that will emerge shortly. Although the spin 
of a spin-half particle is fixed in magnitude, its direction can change. Conse¬ 
quently, the value of the spin angular momentum parallel to any given axis 
can take different values. In §7.4.2 we shall show that parallel to any given 
axis, the spin angular momentum of a spin-half particle can be either 
Consequently, the spin parallel to the z axis is denoted s z 7i, where s z = 
is an observable with the spectrum {—i, i}. 

In §7.4.2 we shall show that if we know both the amplitude a+ that s z 
will be measured to be +5 and the amplitude a_ that a measurement will 
yield s z = — then we can calculate from these two complex numbers the 
amplitudes b + and for the two possible outcomes of the measurement of 
the spin along any direction. If we know only a+ (or only a_), then we can 
calculate neither nor b _ for any other direction. 

Generalising from this example, we have the concept of a complete 
set of amplitudes: the set contains enough information to enable one 
to calculate amplitudes for the outcome of any measurement whatsoever. 
Hence, such a set gives a complete specification of the physical state of the 
system. A complete set of amplitudes is generally understood to be a minimal 
set in the sense that none of the amplitudes can be calculated from the others. 
The set {a_, a+} constitutes a complete set of amplitudes for the spin of an 
electron. 
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1.3.2 Dirac notation 

Dirac introduced the symbol \ip), pronounced ‘ket psi’, to denote a complete 
set of amplitudes for the system. If the system consists of a particle 1 trapped 
in a potential well, \ip) could consist of the amplitudes a n that the energy 
is E n , where (Ei, E 2 , ■. .) is the spectrum of possible energies, or it might 
consist of the amplitudes ip(x) that the particle is found at x, or it might 
consist of the amplitudes a(p) that the momentum is measured to be p. 
Using the abstract symbol | i/j) enables us to think about the system without 
committing ourselves to what complete set of amplitudes we are going to 
use, in the same way that the position vector x enables us to think about 
a geometrical point independently of the coordinates ( x,y,z ), (r, #,</>) or 
whatever by which we locate it. That is, |i/>) is a container for a complete set 
of amplitudes in the same way that a vector x is a container for a complete 
set of coordinates. 

The ket \ip) encapsulates the crucial concept of a quantum state, which 
is independent of the particular set of amplitudes that we choose to quantify 
it, and is fundamental to several branches of physics. 

We saw in the last section that amplitudes must sometimes be added: if 
an outcome can be achieved by two different routes and we do not monitor 
the route by which it is achieved, we add the amplitudes associated with each 
route to get the overall amplitude for the outcome. In view of this additivity, 
we write 

|V>3) = |Vd) + 1^2) (1-15) 

to mean that every amplitude in the complete set |^> 3 ) is the sum of the 
corresponding amplitudes in the complete sets |Vh) and |-i/; 2 }. This rule is 
exactly analogous to the rule for adding vectors because b 3 = b 3 + b 2 implies 
that each component of b 3 is the sum of the corresponding components of 
bi and b 2 . 

Since amplitudes are complex numbers, for any complex number a we 
can define 

W) = a\ijj) (1.16) 

to mean that every amplitude in the set \ip') is a times the corresponding 
amplitude in | if)}. Again there is an obvious parallel in the case of vectors: 
3b is the vector that has x component 3 b x , etc. 


1.3.3 Vector spaces and their adjoints 

The analogy between kets and vectors proves extremely fruitful and is worth 
developing. For a mathematician, objects, like kets, that you can add and 
multiply by arbitrary complex numbers inhabit a vector space. Since we 
live in a (three-dimensional) vector space, we have a strong intuitive feel for 
the structures that arise in general vector spaces, and this intuition helps 
us to understand problems that arise with kets. Unfortunately our every¬ 
day experience does not prepare us for an important property of a general 
vector space, namely the existence of an associated ‘adjoint’ space, because 
the space adjoint to real three-dimensional space is indistinguishable from 
real space. In quantum mechanics and in relativity the two spaces are dis¬ 
tinguishable. We now take a moment to develop the mathematical theory 
of general vector spaces in the context of kets in order to explain the re¬ 
lationship between a general vector space and its adjoint space. When we 
are merely using kets as examples of vectors, we shall call them “vectors”. 
Appendix G explains how these ideas are relevant to relativity. 

1 Most elementary particles have intrinsic angular momentum or ‘spin’ (§7.4). A com¬ 
plete set of amplitudes for a particle such as electron or proton that has spin, includes 
information about the orientation of the spin. In the interests of simplicity, in our discus¬ 
sions particles are assumed to have no spin unless the contrary is explicitly stated, even 
though spinless particles are rather rare. 
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For any vector space V it is natural to choose a set of basis vectors, 
that is, a set of vectors \i) that is large enough for it to be possible to 
express any given vector | ip) as a linear combination of the set’s members. 
Specifically, for any ket | ip) there are complex numbers at such that 


i 


(1.17) 


The set should be minimal in the sense that none of its members can be 
expressed as a linear combination of the remaining ones. In the case of ordi¬ 
nary three-dimensional space, basis vectors are provided by the unit vectors 
i, j and k along the three coordinate axes, and any vector b can be expressed 
as the sum b = aii + a 2 j + 03 k, which is the analogue of equation (1.17). 

In quantum mechanics an important role is played by complex-valued 
linear functions on the vector space V because these functions extract the 
amplitude for something to happen given that the system is in the state | ip). 
Let (/| (pronounced 'bra f’) be such a function. We denote by (f\ip) the 
result of evaluating this function on the ket | ip). Hence, (/ \i/j) is a complex 
number (a probability amplitude) that in the ordinary notation of functions 
would be written f The linearity of the function (/1 implies that for 

any complex numbers a, (3 and kets | ip), |</>), it is true that 

(/l(o#) +P\</>)) = a(f\ip) + fi(f\4>). (1.18) 

Notice that the right side of this equation is a sum of two products of complex 
numbers, so it is well defined. 

To define a function on V we have only to give a rule that enables us 
to evaluate the function on any vector in V. Hence we can define the sum 
(h\ = (/| + (g\ of two bras (/| and (g| by the rule 

</#> = (fW + (1.19) 

Similarly, we define the bra ( p\ = a(f | to be result of multiplying (/| by 
some complex number a through the rule 

( 2 #> = a(f\^). ( 1 . 20 ) 

Since we now know what it means to add these functions and multiply them 
by complex numbers, they form a vector space V', called the adjoint space 
of V. 

The dimension of a vector space is the number of vectors required to 
make up a basis for the space. We now show that V and V' have the same 
dimension. Let 2 (|i)} for i = 1,7V be a basis for V. Then a linear function 
(/| on V is fully defined once we have given the N numbers (f\i). To see 
that this is true, we use (1.17) and the linearity of (/| to calculate (/ \tp) for 
an arbitrary vector | ip) = JT a,:|«): 


N 

</W = £«i</N>. ( L21 ) 

»=1 

This result implies that we can define N functions (j\ (j = 1,1V) through 
the equations 

(j\i)=S t ,, (1.22) 

where Sij is 1 if i = j and zero otherwise, because these equations specify the 
value that each bra (j\ takes on every basis vector |i) and therefore through 


2 Throughout this book the notation {xi} means ‘the set of objects xd. 



1.3 Quantum states 


11 


(1.21) the value that (j | takes on any vector Now consider the following 
linear combination of these bras: 


N 

(F\ = ^2(f\j)(j\. (1.23) 

3=1 

It is trivial to check that for any i we have (F|i) = (f\i), and from this 
it follows that (F\ = (/| because we have already agreed that a bra is fully 
specified by the values it takes on the basis vectors. Since we have now shown 
that any bra can be expressed as a linear combination of the N bras specified 
by (1.22), and the latter are manifestly linearly independent, it follows that 
the dimensionality of V' is N, the dimensionality of V. 

In summary, we have established that every iV-dimensional vector space 
V comes with an iV-dimensional space V' of linear functions on V, called the 
adjoint space. Moreover, we have shown that once we have chosen a basis 
(|*)} for V, there is an associated basis {(«|} for V'. Equation (1.22) shows 
that there is an intimate relation between the ket |i) and the bra (i|: (i\i) = 1 
while (j\i) = 0 for j ^ i. We acknowledge this relationship by saying that (i| 
is the adjoint of |*). We extend this definition of an adjoint to an arbitrary 
ket \ip) as follows: if 


I i>) = E fli l i) then (V'l = Yl a i^ i- 

i i 


(1.24) 


With this choice, when we evaluate the function (ip | on the ket | ip) we find 

(ip\ip) = = En 2 ^ 0 - ( L25 ) 

i 3 i 

Thus for any state the number ( ip\ip) is real and non-negative, and it can 
vanish only if | ip) = 0 because every at vanishes. We call this number the 
length of \ip). 

The components of an ordinary three-dimensional vector b = b x i + 
b y i + b z k are real. Consequently, we evaluate the length-square of b as 
simply (b x i + b y j + b z k) • (b x i + b y j + b z k) = b x + by + b The vector on the 
extreme left of this expression is strictly speaking the adjoint of b but it is 
indistinguishable from it because we have not modified the components in 
any way. In the quantum mechanical case eq. 1.25, the components of the 
adjoint vector are complex conjugates of the components of the vector, so 
the difference between a vector and its adjoint is manifest. 

If | <p) = JT bi\i) and | ip) = : a»|i) are any two states, a calculation 

analogous to that in equation (1.25) shows that 


Similarly, we can show that (ip\<p) = a*bi, and from this it follows that 


(V#> = 


(1.27) 


We shall make frequent use of this equation. 

Equation (1.26) shows that there is a close connection between extract¬ 
ing the complex number (<p\ip) from (<p\ and | ip) and the operation of taking 
the dot product between two vectors b and a. 
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1.3.4 The energy representation 

Suppose our system is a particle that is trapped in some potential well. Then 
the spectrum of allowed energies will be a set of discrete numbers E 0 , E 1 , ... 
and a complete set of amplitudes are the amplitudes a* whose mod squares 
give the probabilities pt of measuring the energy to be Ei. Let (|«)} be a set 
of basis kets for the space V of the system’s quantum states. Then we use 
the set of amplitudes a, to associate them with a ket | ip) through 


W) =J2a,i\i). 

i 


(1.28) 


This equation relates a complete set of amplitudes {a{\ to a certain ket 
We discover the physical meaning of a particular basis ket, say | k), by 
examining the values that the expansion coefficients at take when we apply 
equation (1.28) in the case |fc) = |?/>). We clearly then have that a, = 0 for 
i ^ k and Ofc = 1. Consequently, the quantum state | k) is that in which 
we are certain to measure the value Ek for the energy. We say that | k) is 
a state of well defined energy. It will help us remember this important 
identification if we relabel the basis kets, writing | Ei) instead of just |*), so 
that (1.28) becomes 

W) =Y^ a i\ E i)- (1-29) 

i 

Suppose we multiply this equation through by (Ek |. Then by the lin¬ 
earity of this operation and the orthogonality relation (1.22) (which in our 
new notation reads (Ek\Ei) — Sik) we find 


afc = (E k \ip). 


(1.30) 


This is an enormously important result because it tells us how to extract from 
an arbitrary quantum state | if>) the amplitude for finding that the energy is 

Ek- 

Equation (1.25) yields 

('*/#) = N 2 = Yl Pi = lj ( L31 ) 

i i 


where the last equality follows because if we measure the energy, we must 
find some value, so the probabilities pi must sum to unity. Thus kets that 
describe real quantum states must have unit length: we call kets with unit 
length properly normalised. During calculations we frequently encounter 
kets that are not properly normalised, and it is important to remember that 
the key rule (1.30) can be used to extract predictions only from properly 
normalised kets. Fortunately, any ket \(p) = JT bt\i) is readily normalised: it 
is straightforward to check that 




t vW> 


(1.32) 


is properly normalised regardless of the values of the bi. 


1.3.5 Orientation of a spin-half particle 

Formulae for the components of the spin angular momentum of a spin-half 
particle that we shall derive in §7.4.2 provide a nice illustration of how the 
abstract machinery just introduced enables us to predict the results of ex¬ 
periments. 

If you measure one component, say s z , of the spin s of an electron, you 
will obtain one of two results, either s z = I or s z = — Moreover the state 
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|+) in which a measurement of s z is certain to yield ^ and the state |—) in 
which the measurement is certain to yield — i form a complete set of states 
for the electron’s spin. That is, any state of spin can be expressed as a linear 
combination of |+) and |—): 

IV'} = a-|—} + a+|+). (1.33) 


Let n be the unit vector in the direction with polar coordinates (0, </>). 
Then the state |+,n) in which a measurement of the component of s along 
n is certain to return i turns out to be (Problem 7.6) 


|+, n) = 8111 ( 0 / 2 ) e 1 ^/ 2 !—) + cos(0/2) e 1<?i / 2 |+). 


(1.34a) 


Similarly the state | —, n) in which a measurement of the component of s 
along n is certain to return — i is 


—, n) = cos(0/2) e 1 ^ 2 !—) — sin(0/2) e 1<?i / 2 |+). 


(1.34b) 


By equation (1.24) the adjoints of these kets are the bras 

(+, n| = sin(0/2) e _1<?i / 2 (—| + cos(0/2) e 1<?i / 2 (+| 
(—, n| = cos(0/2) e _1< ^/ 2 (— | — sin(0/2) 


(1.35) 


From these expressions it is easy to check that the kets |±,n) are properly 
normalised and orthogonal to one another. 

Suppose we have just measured s z and found the value to be \ and we 
want the amplitude A _ (n) to find — 4 when we measure n • s. Then the state 
of the system is \i/j) = |+) and the required amplitude is 

-A-(n) = (—,n|^) = (—,n|+) = - sin(0/2)e“^ 2 , (1.36) 

so the probability of this outcome is 

P- (n) = |A_(n)| 2 = sin 2 (0/2). (1.37) 

This vanishes when 0 = 0 as it should since then n = (0,0,1) son-s = s 21 
and we are guaranteed to find s z = \ rather than — P_(n) rises to \ when 

0 = 7t/ 2 and n lies somewhere in the x,y plane. In particular, if s z = i, a 
measurement of s x is equally likely to return either of the two possible values 

± T 

Putting 0 = 7 t/2, (f> = 0 into equations (1-34) we obtain expressions for 
the states in which the result of a measurement of s x is certain 

!+,*> = ^<l->-H+» ; K*> = ^(l-}-l+»- (1-38) 

Similarly, inserting 0 = 7r/2, <f> = tt/2 we obtain the states in which the result 
of measuring s y is certain 


l+;2/) = V ( R“ i|+)) ; \-,y) = -^-(|-) + i|+))- (i-39) 


Notice that |+,x) and |+,j/) are both states in which the probability of 
measuring s z to be ^ is What makes them physically distinct states is 
that the ratio of the amplitudes to measure ±4 for s z is unity in one case 
and i in the other. 
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1.3.6 Polarisation of photons 

A discussion of the possible polarisations of a beam of light displays an 
interesting connection between quantum amplitudes and classical physics. 
At any instant in a polarised beam of light, the electric vector E is in one 
particular direction perpendicular to the beam. In a plane-polarised beam, 
the direction of E stays the same, while in a circularly polarised beam it 
rotates. A sheet of Polaroid transmits the component of E in one direction 
and blocks the perpendicular component. Consequently, in the transmitted 
beam |E| is smaller than in the incident beam by a factor cos 9, where 9 is 
the angle between the incident field and the direction in the Polaroid that 
transmits the field. Since the beam’s energy flux is proportional to |E| 2 , a 
fraction cos 2 9 of the beam’s energy is transmitted by the Polaroid. 

Individual photons either pass through the Polaroid intact or are ab¬ 
sorbed by it depending on which quantum state they are found to be in 
when they are ‘measured’ by the Polaroid. Let |—>) be the state in which the 
photon will be transmitted and |f) that in which it will be blocked. Then 
the photons of the incoming plane-polarised beam are in the state 

\ip) = cos0|—>■} + sin 0 It), (1-40) 

so each photon has an amplitude cu> = cos 9 for a measurement by the 
Polaroid to find it in the state |—>) and be transmitted, and an amplitude 
a-j- = sin0 to be found to be in the state |f) and be blocked. The fraction 
of the beam’s photons that are transmitted is the probability get through 
P-+ = | CL—y | 2 = cos 2 9. Consequently a fraction cos 2 9 of the incident energy 
is transmitted, in agreement with classical physics. 

The states |—>) and |f) form a complete set of states for photons that 
move in the direction of the beam. An alternative complete set of states is 
the set (|+), |—}} formed by the state |+) of a right-hand circularly polarised 
photon and the state |—) of a left-hand circularly polarised photon. In the 
laboratory a circularly polarised beam is often formed by passing a plane 
polarised beam through a birefringent material such as calcite that has its 
axes aligned at 45° to the incoming plane of polarisation. The incoming 
beam is resolved into its components parallel to the calcite’s axes, and one 
component is shifted in phase by ir/2 with respect to the other. In terms of 
unit vectors and e y parallel to the calcite’s axes, the incoming field is 

E=^{(e, + e. y )e- i - t } (1.41) 

and the outgoing field of a left-hand polarised beam is 

E - = ^3?{(e :c + i e y ) e - iwt }, (1.42a) 

while the field of a right-hand polarised beam would be 

E + = ^3?{(e,-ie y )e- iwt }. (1.42b) 

The last two equations express the electric field of a circularly polarised 
beam as a linear combination of plane polarised beams that differ in phase. 
Conversely, by adding (1.42b) to equation (1.42a), we can express the electric 
field of a beam polarised along the x axis as a linear combination of the fields 
of two circularly-polarised beams. 

Similarly, the quantum state of a circularly polarised photon is a linear 
superposition of linearly-polarised quantum states: 

l±) = 4(h) Tilt)), 


(1.43) 
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and conversely, a state of linear polarisation is a linear superposition of states 
of circular polarisation: 


h) = ^(l+} + |-})- (1-44) 

Whereas in classical physics complex numbers are just a convenient way of 
representing the real function cos (uit + </>) for arbitrary phase <j>, quantum 
amplitudes are inherently complex and the operator 5ft is not used. Whereas 
in classical physics a beam may be linearly polarised in a particular direction, 
or circularly polarised in a given sense, in quantum mechanics an individual 
photon has an amplitude to be linearly polarised in a any chosen direction 
and an amplitude to be circularly polarised in a given sense. The amplitude 
to be linearly polarised may vanish in one particular direction, or it may 
vanish for one sense of circular polarisation. In the general case the photon 
will have a non-vanishing amplitude to be polarised in any direction and any 
sense. After it has been transmitted by an analyser such as Polaroid, it will 
certainly be in whatever state the analyser transmits. 


1.4 Measurement 

Equation (1.28) expresses the quantum state of a system | ip) as a sum over 
states in which a particular measurement, such as energy, is certain to yield a 
specified value. The coefficients in this expansion yield as their mod-squares 
the probabilities with which the possible results of the measurement will be 
obtained. Hence so long as there is more than one term in the sum, the result 
of the measurement is in doubt. This uncertainty does not reflect shortcom¬ 
ings in the measuring apparatus, but is inherent in the physical situation - 
any defects in the measuring apparatus will increase the uncertainty above 
the irreducible minimum implied by the expansion coefficients, and in §6.3 
the theory will be adapted to include such additional uncertainty. 

Here we are dealing with ideal measurements, and such measurements 
are reproducible. Therefore, if a second measurement is made immediately 
after the first, the same result will be obtained. From this observation it 
follows that the quantum state of the system is changed by the first mea¬ 
surement from \i/j) = to | ip) = 1-0, where 1 1) is the state in which 

the measurement is guaranteed to yield the value that was obtained by the 
first measurement. The abrupt change in the quantum state from Xu a il*) 
to 1 1) that accompanies a measurement is referred to as the collapse of the 
wavefunction. 

What happens when the “wavefunction collapses”? It is tempting to 
suppose that this event is not a physical one but merely an updating of 
our knowledge of the system: that the system was already in the state 1 1) 
before the measurement, but we only became aware of this fact when the 
measurement was made. It turns out that this interpretation is untenable, 
and that wavefunction collapse is associated with a real physical disturbance 
of the system. This topic is explored further in §6.5. 

Problems 

1.1 What physical phenomenon requires us to work with probability am¬ 
plitudes rather than just with probabilities, as in other fields of endeavour? 

1.2 What properties cause complete sets of amplitudes to constitute the 
elements of a vector space? 

1.3 V' is the dual space of the vector space V. For a mathematician, what 
objects comprise V'l 
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1.4 In quantum mechanics, what objects are the members of the vector 
space VI Give an example for the case of quantum mechanics of a member 
of the dual space V' and explain how members of V' enable us to predict 
the outcomes of experiments. 

1.5 Given that \ip) = e 17r / s |a) +e 17r / 4 |6), express {ip | as a linear combination 
of (a| and (&|. 

1.6 What properties characterise the bra (a| that is associated with the ket 

|a>? 

1.7 An electron can be in one of two potential wells that are so close that 
it can “tunnel” from one to the other (see §5.2 for a description of quantum- 
nreclianical tunnelling). Its state vector can be written 

\iP)=a\A) + b\B), (1.45) 

where | A) is the state of being in the first well and | B) is the state of being in 
the second well and all kets are correctly normalised. What is the probability 
of finding the particle in the first well given that: (a) a = i/2; (b) b = e 171 ’; 
(c) b = i + iA/2? 

1.8 An electron can “tunnel” between potential wells that form a chain, so 
its state vector can be written 


OO 

M = X>I»>, (1.46a) 

— OO 


where | n) is the state of being in the n th 
right. Let 


= —(~ 

72 V 3 


well, where n increases from left to 


M/2 

e in7r 


(1.46b) 


a. What is the probability of finding the electron in the n th well? 

b. What is the probability of finding the electron in well 0 or anywhere to 
the right of it? 



2 

Operators, measurement and time 
evolution 


In the last chapter we saw that each quantum state of a system is represented 
by a point or ‘kef \ip) that lies in an abstract vector space. We saw that 
states for which there is no uncertainty in the value that will be measured 
for a quantity such as energy, form a set of basis states for this space - 
these basis states are analogous to the unit vectors i, j and k of ordinary 
vector geometry. In this chapter we develop these ideas further by showing 
how every measurable quantity such as position, momentum or energy is 
associated with an operator on state space. We shall see that the energy 
operator plays a special role in that it determines how a system’s ket l^} 
moves through state space over time. Using these operators we are able 
at the end of the chapter to study the dynamics of a free particle, and to 
understand how the uncertainties in the position and momentum of a particle 
are intimately connected with one another, and how they evolve in time. 


2.1 Operators 

A linear operator on the vector space V is an object Q that transforms 
kets into kets in a linear way. That is, if \i[>) is a ket, then \<j>) = Q\i/j) is 
another ket, and if |y) is a third ket and a and /? are complex numbers, we 
have 

Q(a\ip) + /3|x» = a(Q\ip)) + P(Q\x))- (2.1) 

Consider now the linear operator 

/=£im (2.2) 

i 


where (|i)} is any set of basis kets. / really is an operator because if we 
apply it to any ket \ip), we get a linear combination of kets, which must itself 
be a ket: 


Af) = £ !*)(#) = £((#)) H), 


(2.3) 



18 


Chapter 2: Operators, measurement and time evolution 


where we are able to move around freely because it’s just a complex 
number. To determine which ket I\ip) is, we substitute into (2.3) the expan¬ 
sion (1.17) of | ip) and use the orthogonality relation (1.22): 


i ^ j 

= J2 a i \*> = IV’)- 


(2.4) 


We have shown that / applied to an arbitrary ket \ip) yields that same ket. 
Hence I is the identity operator. We shall make extensive use of this fact. 
Consider now the operator 


H = Y J E i \E i ){E i \. (2.5) 

i 


This is the most important single operator in quantum mechanics. It is called 
the Hamiltonian in honour of W.R. Hamilton, who introduced its classical 
analogue. 1 We use H to operate on an arbitrary ket \ip) to form the ket 
H\ip), and then we bra through by the adjoint {ip\ of \ip). We have 

(TPm) = '£EME i )(E i \i>). ( 2 . 6 ) 


By equation (1.29) (Ei\ip) = ai, while by (1.24) (ip\Ei) = a*. Thus 

wm) = = Y,Pi E i = ( E ) • ( 2 - 7 ) 

i i 

Here is yet another result of fundamental importance: if we squeeze the 
Hamiltonian between a quantum state | ip) and its adjoint bra, we obtain the 
expectation value of the energy for that state. 

It is straightforward to generalise this result for the expectation value 
of the energy to other measurable quantities: if Q is something that we can 
measure (often called an observable) and its spectrum of possible values is 
{}, then we expand an arbitrary ket \ip) as a linear combination of states 
|gi) in which the value of Q is well defined, 

H) ( 2 - 8 ) 

i 

and with Q we associate the operator 

Q = (2-9) 

i 

Then (^>|Q|^>) is the expectation value of Q when our system is in the state 
IV’)- When the state in question is obvious from the context, we shall some¬ 
times write the expectation value of Q simply as ( Q). 

When a linear operator R turns up in any mathematical problem, it 
is generally expedient to investigate its eigenvalues and eigenvectors. An 
eigenvector is a vector that R simply rescales, and its eigenvalue is the 
rescaling factor. Thus, let |r) be an eigenvector of R, and r be its eigenvalue, 
then we have 

i?|r) = rjr). (2.10) 

1 William Rowan Hamilton (1805-1865) was a protestant Irishman who was appointed 
the Andrews’ Professor of Astronomy at Trinity College Dublin while still an undergrad¬ 
uate. Although he did not contribute to astronomy, he made important contributions to 
optics and mechanics, and to pure mathematics with his invention of quaternions, the first 
non-commutative algebra. 
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Box 2.1: Hermitian Operators 

Let Q be a Hermitian operator with eigenvalues qt and eigenvectors \qi). 
Then we bra the defining equation of \qi) through by (qk\, and bra the 
defining equation of \qk) through by {qt |: 

(■ <ik\Q\qi) = ft(fttlft) (ft|Q|ft=) = qkiqMk)- 

We next take the complex conjugate of the second equation from the first. 
The left side then vanishes because Q is Hermitian, so with equation 
(1.27) 

0 = (ft - <?fc)(ftc|ft). 

Setting k = i we find that ft = q* since (ft|ft) > 0. Hence the eigenvalues 
are real. When qi ^ q k , we must have (ftc|ft) = 0, so the eigenvectors 
belonging to distinct eigenvalues are orthogonal. 


What are the eigenvectors and eigenvalues of HI If we apply H to \E k ), we 
find 

H\E k ) =Y J E i \E i )(E i \E k ) = E k \E k ). (2.11) 

i 

So the eigenvectors of H are the states of well defined energy, and its eigen¬ 
values are the possible results of a measurement of energy. Clearly this 
important result generalises immediately to eigenvectors and eigenvalues of 
the operator Q that we have associated with an arbitrary observable. 

Consider the complex number (4>\Q\ip), where | <j>) and | ip) are two arbi¬ 
trary quantum states. After expanding the states in terms of the eigenvectors 
of Q, we have 

(0IW) = fkj>) = Yl b *i a i q o 6 H =J2 b *i qiai ( 2 - 12 ) 

^ i ' ^ j ' ij i 

Similarly, (^|Q|0) = JAa*ftfej. Hence so long as the spectrum {qi} of Q 
consists entirely of real numbers (which is physically reasonable), then 

MQW)Y = (2-13) 

for any two states \<p) and | ip). An operator with this property is said to 
be Hermitian. Hermitian operators have nice properties. In particular, 
one can prove - see Box 2.1 - that they have real eigenvalues and mutually 
orthogonal eigenvectors, and it is because we require these properties on 
physical grounds that the operators of observables turn out to be Hermitian. 
In Chapter 4 we shall find that Hermitian operators arise naturally from 
another physical point of view. 

Although the operators associated with observables are always Hermi¬ 
tian, operators that are not Hermitian turn out to be extremely useful. With 
a non-Hermitian operator R we associate another operator R) called its Her¬ 
mitian adjoint by requiring that for any states \<j>) and | ip) it is true that 

= wm- ( 2 - 14 ) 

Comparing this equation with equation (2.13) it is clear that a Hermitian 
operator Q is its own adjoint: = Q. 

By expanding the kets \<f>) and | ijj) in the equation | <f>) = R\ip) as sums of 
basis kets, we show that R is completely determined by the array of numbers 
(called matrix elements) 


Ri] = {i\ R \j)- 


( 2 . 15 ) 
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Table 2.1 Rules for Hermitian adjoints 


Object 

i |V>) R 

QR 

m 

wm 

Adjoint 

-i (i/>\ i?t 

R^Q^ 

MR' 



In fact _ 

\<t>) = E bi i*) = R \^ = E a o R \j) 

j (2.16) 
=> bi = 'y ' cij(i\R\j) = y ' Rijdj. 

j 3 

If in equation (2.14) we set | <f>) = |z) and \tp) = |j), we discover the 
relation between the matrix of R and that of R': 

{R\ j y=R ji <> R\ j = RT Ji . (2.17) 

Hence the matrix of R^ is the complex-conjugate transpose of the matrix 
for R. If R is Hermitian so that R' = R , the matrix Rij must equal its 
complex-conjugate transpose, that is, it must be an Hermitian matrix. 

Operators can be multiplied together: when the operator QR operates 
on \-ijj), the result is what you get by operating first with R and then applying 
Q to R\ip). We shall frequently need to find the Hermitian adjoints of such 
products. To find out how to do this we replace R in (2.17) by QR: 

(< Q R % = (Q R )ji = E QiA = E R lQlj = ( R 'Q%- ( 2 - 18 ) 

k k 

Thus, to dagger a product we reverse the terms and dagger the individual 
operators. By induction it is now easy to show that 

{ABC...Zy =Z t ...C t H t H t . (2.19) 

If we agree that the Hermitian adjoint of a complex number is its com¬ 
plex conjugate and that |? j/)' = {tp\ and = \i/j), then we can consider the 
basic rule (2.14) for taking the complex conjugate of a matrix element to be 
a generalisation of the rule we have derived about reversing the order and 
daggering the components of a product of operators. The rules for taking 
Hermitian adjoints are summarised in Table 2.1. 

Functions of operators We shall frequently need to evaluate functions 
of operators. For example, the potential energy of a particle is a function 
V (x) of the position operator x. Let / be any function of one variable and 
R be any operator. Then we define the operator f(R) by the equation 

/GR) = E/(^) InXnl, (2.20) 


where the r,; and |r*) are the eigenvalues and eigenkets of R. This definition 
defines f(R) to be the operator that has the same eigenkets as R and the 
eigenvalues that you get by evaluating the function / on the eigenvalues of 
R. 

Commutators The commutator of two operators A, B is defined to be 

[A, B] = AB — BA. (2.21) 

If [A, B] y 0, it is impossible to find a complete set of mutual eigenkets of A 
and B (Problem 2.19). Conversely, it can be shown that if [A, B] = 0 there 
is a complete set of mutual eigenkets of A and B, that is, there is a complete 
set of states of the system in which there is no uncertainty in the value that 
will be obtained for either A or B. We shall make extensive use of this fact. 
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Notice that the word complete appears in both these statements; even in the 
case [A, B] ^ 0 it may be possible to find states in which both A and B 
have definite values. It is just that such states cannot form a complete set. 
Similarly, when [ A , B] = 0 there can be states for which A has a definite 
value but B does not. The literature is full of inaccurate statements about 
the implications of [A, B] being zero or non-zero. 

Three invaluable rules are 

[A + B,C] = [A, C] + [B, C] 

AB = BA + [A, B] (2.22) 

[AB,C] = [A,C]B + A[B,C]. 

All three rules are trivial to prove by explicitly writing out the contents of 
the square brackets. With these rules it is rarely necessary to write out the 
contents of a commutator again, so they eliminate a common source of error 
and tedium in calculations. Notice the similarity of the third rule to the 
standard rule for differentiating a product: d(a6)/dc = (da/dc)6 + a(d6/dc). 
The rule is easily generalised by induction to the rule 

[ABC ..., Z\ = [A, Z]BC...+ A[B, Z]C... + AB[C, Z}... (2.23) 

We shall frequently need to evaluate the commutator of an operator 
A with a function / of an operator B. We assume that / has a convergent 
Taylor series 2 / = /o+/' B+\f"B 2 +• • •, where f 0 = /(0), /' = (d/(x)/dx) 0 , 
etc., are numbers. Then 

[A, f(B)\ = /'[A, B\ + If"{[A, B}B + B[A , B]) 

+ Iff'"([A, B]B 2 + B[A, B\B + B 2 [A , B]) H- ' 1 

In the important case in which B commutes with [A, B\, this expression 
simplifies dramatically 

[A, f{B)\ = [A, B}(f + f"B + \f"B 2 + •■■) = [A, B]^. (2.25) 

We shall use this formula several times. 


2.2 Evolution in time 

Since physics is about predicting the future, equations of motion lie at its 
heart. Newtonian dynamics is dominated by the equation of motion f = 
ma, where f is the force on a particle of mass m and a is the resulting 
acceleration. In quantum mechanics the analogous dynamical equation is 

the time-dependent Schrodinger equation (TDSE): 3 


i Ti 


m 

dt 




(2.26) 


For future reference we use the rules of Table 2.1 to derive from this equation 
the equation of motion of a bra: 


—i h 


dt 




(2.27) 


2 If necessary, we expand f(x) about some point xq / 0, i.e., in powers of x — xo, so 
we don’t need to worry that the series about the origin may not converge for all x. 

3 Beginners sometimes interpret the TDSE as stating that H = \hd/dt. This is as 
unhelpful as interpreting f = ma as a definition of f . For Newton’s equation to be useful 
it has to be supplemented by a description of the forces acting on the particle. Similarly, 
the TDSE is useful only when we have another expression for H. 
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where we have used the fact that H is Hermitian, so H' = H. The great 
importance of the Hamiltonian operator is due to its appearance in the tdse, 
which must be satisfied by the ket of any system. We shall see below in 
several concrete examples that the tdse, which we have not attempted to 
motivate physically, generates familiar motions in circumstances that permit 
classical mechanics to be used. 

One perhaps surprising aspect of the tdse we can justify straight away: 
while Newton’s second law is a second-order differential equation, the tdse 
is first-order. Since it is first order, the boundary data at t = 0 required to 
solve for | ip, t) at t > 0 comprise the ket \ip, 0). If the equation were second- 
order in time, like Newton’s law, the required boundary data would include 
d\ip)/dt. But \ip,0) by hypothesis constitutes a complete set of amplitudes; 
it embodies everything we know about the current state of the system. If 
mathematics required us to know something about the system in addition to 
|i/>,0), then either \ip) would not constitute a complete set of amplitudes, or 
physics could offer no hope of predicting the future, and it would be time to 
take up biology or accountancy, or whatever. 

The tdse tells us that states of well-defined energy evolve in time in an 
exceptionally simple way 


= H \ En) = En \E n ), (2.28) 

ot 

which implies that 

\E n ,t) = \E n ,0)e~ iE " t / h . (2.29) 

That is, the passage of time simply changes the phase of the ket at a rate 
E n /h. 

We can use this result to calculate the time evolution of an arbitrary 
state l^). In the energy representation the state is 


| ip,t) = y^ i a„{t)\E n ,t). (2.30) 

n 


Substituting this expansion into the tdse (2.26) we find 

= Y, m ( h n\E n ) + an^^j =^a n tf|£ n }, (2.31) 

n ' ' n 

where a dot denotes differentiation with respect to time. The right side 
cancels with the second term in the middle, so we have a n = 0. Since the a n 
are constant, on eliminating \E n ,t) between equations (2.29) and (2.30), we 
find that the evolution of \ip) is simply given by 

\ip, t>=£ a n e~ iEnt,h \E„, 0). (2.32) 

n 

We shall use this result time and again. 

States of well-defined energy are unphysical and never occur in Nature 
because they are incapable of changing in any way, and hence it is impossible 
to get a system into such a state. But they play an extremely important role 
in quantum mechanics because they provide the almost trivial solution (2.32) 
to the governing equation of the theory, (2.26). Given the central role of these 
states, we spend much time solving their defining equation 

H\E n )=E n \E n ), (2.33) 

which is known as the time-independent Schrodinger equation, or 
TISE for short. 
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2.2.1 Evolution of expectation values 

We have seen that {ip\Q\ip) is the expectation value of the observable Q 
when the system is in the state \ip), and that expectation values provide a 
natural connection to classical physics, which is about situations in which the 
result of a measurement is almost certain to lie very close to the quantum- 
mechanical expectation value. We can use the tdse to determine the rate 
of change of this expectation value: 

= — {ip\HQ\ip) + ih(ip\^-\ip) + (ip\QH\ip) 
at, at (2.34) 

= (^\[Q,H]\ip)+ih(ip\^\ip), 

where we have used both the tdse (2.26) and its Hermitian adjoint (2.27) 
and the square bracket denotes a commutator - see (2.21). Usually operators 
are independent of time (i.e., dQ/dt = 0), and then the rate of change of an 
expectation value is the expectation value of the operator —i[Q, H]/h. This 
important result is known as Ehrenfest’s theorem. 

If a time-independent operator Q happens to commute with the Hamil¬ 
tonian, that is if [ Q , H] = 0, then for any state | ip) the expectation value 
of Q is constant in time, or a conserved quantity. Moreover, in these 
circumstances Q 2 also commutes with H, so (ip\(AQ) 2 \ip) = ( Q 2 ) — ( Q ) 2 
is also constant. If initially ip is a state of well-defined Q, i.e., | ip) = \qi) 
for some i, then ((A Q) 2 ) = 0 at all times. Hence, whenever [Q,H] = 0, 
a state of well defined Q evolves into another such state, so the value of Q 
can be known precisely at all times. The value q j is then said to be a good 
quantum number. We always need to label states in some way. The label 
should be something that can be checked at any time and is not constantly 
changing. Good quantum numbers have precisely these properties, so they 
are much employed as labels of states. 

If the system is in a state of well defined energy, the expectation value 
of any time-independent operator is time-independent, even if the operator 
does not commute with H. This is true because in these circumstances 
equation (2.34) becomes 


ih^-(E\Q\E) = (E\(QH - HQ)\E) = (E — E){E\Q\E) = 0, (2.35) 

at 

where we have used the equation H\E) = E\E) and its Hermitian adjoint. 
In view of this property of having constant expectation values of all time- 
independent operators, states of well defined energy are called stationary 
states. 

Since H inevitably commutes with itself, equation (2.34) gives for the 
rate of change of the expectation of the energy 


d(E) _ (dH\ 
dt \ dt / ' 


(2.36) 


In particular (E) is constant if the Hamiltonian is time-independent. This 
is a statement of the principle of the conservation of energy since time- 
dependence of the Hamiltonian arises only when some external force is work¬ 
ing on the system. For example, a particle that is gyrating in a time- 
dependent magnetic field has a time-dependent Hamiltonian because work 
is being done either on or by the currents that generate the field. 
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2.3 The position representation 

If the system consists of a single particle that can move in only one dimension, 
the amplitudes il>(x) to find the particle at x for x in (—oo, oo) constitute a 
complete set of amplitudes. By analogy with equation (1.29) we have 4 

/ OO 

dxip{x)\x). (2.37) 

-OO 

Here an integral replaces the sum because the spectrum of possible values 
of x is continuous rather than discrete. Our basis kets are the states |a;) in 
which the particle is definitely at x. By analogy with equation (1.30) we 
have 

ip(x) = {x\ip). (2.38) 

Notice that both sides of this equation are complex numbers that depend on 
the variable x, that is, they are complex-valued functions of x. For historical 
reasons, the function i/j(x) is called the wavefunction of the particle. By 
the usual rule (1-27) for complex conjugation of a bra-ket we have 

i)*{x) = ( i/j\x ). (2.39) 

The analogue for the kets \x) of the orthogonality relation (1.22) is 

{x'\x)=5{x — x'), (2.40) 

where the Dirac delta function S(x — x') is zero for x ^ x' because when 
the particle is at x, it has zero amplitude to be at a different location x'. 
We get insight into the value of 6(x — x') for x = x' by multiplying equation 
(2.37) through by {x'\ and using equation (2.38) to eliminate {x'\rp)\ 


(Z'M = #*') = / dx 

= /d* *(*)«(*-s'). 


(2.41) 


Since S(x — x') is zero for r ^ s', we can replace ip(x) in the integrand by 
ip(x') and then take this number outside the integral sign and cancel it with 
the if>(x') on the left hand side. What remains is the equation 


1 = 



-*')• 


(2.42) 


Thus there is unit area under the graph of d(a;), which is remarkable, given 
that the function vanishes for x ^ 0! Although the name of d(a;) includes 
the word ‘function’, this object is not really a function because we cannot 
assign it a value at the origin. It is best considered to be the limit of a series 
of functions that all have unit area under their graphs but become more and 
more sharply peaked around the origin (see Figure 2.1). 

The analogue of equation (1.31) is 

J dx\i/j{x)\ 2 = 1, (2.43) 

which expresses the physical requirement that there is unit probability of 
finding the particle at some value of x. 

The analogue of equation (2.2) is 


I = 


da; |x)(x|. 


(2.44) 


4 The analogy would be clearer if we wrote a(x) for 0(x), but for historical reasons 
the letter i/> is hard to avoid in this context. 
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Figure 2.1 A series of Gaussians of unit area. The Dirac delta function is the limit of 
this series of functions as the dispersion tends to zero. 


It is instructive to check that the operator that is defined by the right side 
of this equation really is the identity operator. Applying the operator to an 
arbitrary state | ip) we find 

I\ip) = J dtu |ai)(2.45) 

By equations (2.37) and (2.38) the expression on the right of this equation 
is \4>), so I is indeed the identity operator. 

When we multiply (2.45) by (cf>\ on the left, we obtain an important 
formula 

(4>\ip) = J dx {(f>\x) {x\ip) = J dx(f>*(x)ip(x), (2.46) 

where the second equality uses equations (2.38) and (2.39). Many practical 
problems reduce to the evaluation of an amplitude such as The expres¬ 

sion on the right of equation (2.46) is a well defined integral that evaluates 
to the desired number. 

By analogy with equation (2.5), the position operator is 

x = f dxx\x){x\. (2.47) 


After applying x to a ket \i/j) we have a ket |</>) = x\ip) whose wavefunction 
(f){x') = {x'\x\4>) is 


<t>&) 


j dxx(x'\x)(x\*p) 


/dxxjp-x>(x)=,V(x'), 


(2.48) 


where we have used equations (2.38) and (2.40). Equation (2.48) states that 
the operator x simply multiplies the wavefunction ip(x) by its argument. 

In the position representation, operators turn functions of x into other 
functions of x. An easy way of making a new function out of an old one is 
to differentiate it. So consider the operator p that is defined by 

{x\p\ip) = (pip){x) = (2.49) 

In Box 2.2 we show that the factor i ensures that p is a Hermitian operator. 
The factor h ensures that p has the dimensions of momentum: 5 we will find 

5 Planck’s constant h = 2tt h has dimensions of distance x momentum, or, equivalently, 
energy X time, or, most simply, angular momentum. 
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Box 2.2: Proof that p is Hermitian 

We have to show that for any states \<j>) and \4>), (V’|p|</>} = ((<t>\p\ip))* ■ We 
use equation (2.49) to write the left side of this equation in the position 
representation: 

WI0) =~ih J d xip*(x)^. 

Integrating by parts this becomes 

W|</>) = -i n ^[V’>]!° 00 - J dx ^^^j ■ 

We assume that all wavefunctions vanish at spatial infinity, so the term 
in square brackets vanishes, and 

(^\p\4>) = ift J dx^x)^- = (Wl^))*- 


that p is the momentum operator. In Newtonian physics the momentum 
of a particle of mass m and velocity x is mx, so let’s use equation (2.34) to 
calculate d ( x) /d t and see whether it is ( p ) /to. 


2.3.1 Hamiltonian of a particle 

To calculate any time derivatives in quantum mechanics we need to know 
what the Hamiltonian operator H of our system is because H appears in the 
tdse (2.26). Equation (2.5) defines H in the energy representation, but not 
how to write H in the position representation. We are going to have to make 
an informed guess and justify our guess later. 

The Newtonian expression for the energy of a particle is 

v 2 

E = ±mx 2 + V = + V, (2.50) 

z 2 TO 

where V (x) is the particle’s potential energy. So we guess that the Hamilto¬ 
nian of a particle is 

H = |l + F(i), (2.51, 

where the square of p means the act of operating with p twice ( p 2 = pp). The 
meaning of V(x) is given by equation (2.20) with V and x substituted for / 
and R. Working from that equation in close analogy with the calculation in 
equation (2.48) demonstrates that in the position representation the operator 
V(x) acts on a wavefunction ip{ x ) simply by multiplying ip by V(x). That 
is, (x|V(x)|^) = V(x)ip{x). 

Now that we have guessed that H is given by equation (2.51), the next 
step in the calculation of the rate of change of (x) is to evaluate the commu¬ 
tator of x and H. Making use of equations (2.22) we find 



In the last equality we have used the fact that [x, V (x)] = 0, which follows 
because both x and V (x) act by multiplication, and ordinary multiplication 
is a commutative operation. We now have to determine the value of the 
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commutator [x, p]. We return to the definition (2.49) of p and calculate the 
wavefunction produced by applying [x,p] to an arbitrary state | ip) 

( x \%p\W) = {x\(xp -px)\*p) = -m (x^ - (2 53) 

= ih(x\ip). 

Since this equation holds for any | ip), we have the operator equation 


[x,p\ = in. 


(2.54) 


This key result, that the commutator of x with p is the constant i h, is called 
a canonical commutation relation. 6 Two observables whose commutator 
is PiU are said to be canonically conjugate to one another, or conjugate 
observables. 

Finally we have the hoped-for relation between p and x: substituting 
equations (2.53) and (2.54) into equation (2.34) we have 

= T7<V#IV’) = 

at at rt ri tti ^2 55^ 


This result makes it highly plausible that p is indeed the momentum operator. 

A calculation of the rate of change of (p) will increase the plausibility 
still further. Again working from (2.34) and using (2.51) we have 


^ = 4 ®, * 1 > = 4 <&P,n- ( 2 . 56 ) 


Since [p, x\ = —in is just a number, equation (2.25) for the commutator of 
one operator with a function of another operator can be used to evaluate 
[p, V (£)] • We then have 


d(p) /dV\ 

dt \ dx / 


(2.57) 


That is, the expectation of the rate of change of the momentum is equal 
to the expectation of the force on the particle. Thus we have recovered 
Newton’s second law from the tdse. This achievement gives us confidence 
that (2.51) is the correct expression for H. 


2.3.2 Wavefunction for well defined momentum 

From the discussion below equation (2.11) we know that the state \p) in which 
a measurement of the momentum will certainly yield the value p has to be 
an eigenstate of p. We find the wavefunction u p {x) = (x\p) of this important 
state by using equation (2.49) to write the defining equation p\p) = p\p) in 
the position representation: 


F)qi 

(x\p\p) = -ift-^y =P{x\p) =pu p {x). 


(2.58) 


The solution of this differential equation is 

Up(x) = Ae ipx/n . (2.59) 


6 The name that derives from ‘canonical coordinates’ in Hamilton’s formulation of 
classical mechanics. 
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Box 2.3: Gaussian integrals 

Consider the integral 

/ OO 

dxe-^ x2+ax \ ( 1 ) 

-OO 

where a and b are constants. We observe that b 2 x 2 +ax = (bx + a/2b ) 2 — 
a 2 /4b 2 . Thus we may write I = e a / 4b b~ l J d ze~ z , where z = bx+a/2b. 
The integral is equal to y/n. Hence we have the very useful result 

f°° dxe~ {b2x2+ax) = ^ e“ 2/4fc2 . (2) 

J — OO ” 


Hence the wavefunction of a particle of well defined momentum is a plane 
wave with wavelength A = 2ir/k = h/y/2mE, where m is the particle’s mass 
and E its kinetic energy; A is called the particle’s de Broglie wavelength. 7 

If we try to choose the constant A in (2.59) to ensure that u p satisfies 
the usual normalisation condition (2.43), we will fail because the integral 
over all x of \e lpx / n \ 2 = 1 is undefined. Instead we choose A as follows. By 
analogy with (2.40) we require {p'\p) = 8{jp — p'). When we use (2.44) to 
insert an identity operator into this expression, it becomes 

S(p — p') = J dx (p'\x)(x\p) = \A\ 2 J dxe l( ' p ~ p ^ x ! n = 2ttTi\A\ 2 8{p — p r ), 

(2.60) 

where we have used equation (C.12) to evaluate the integral. Thus \A\ 2 = 
h~ l , where h = 2nh is Planck’s constant, and the correctly normalised wave- 
function of a particle of momentum p is 

Up (x) = (x\p) = ^e ipx / n . (2.61) 


The uncertainty principle It follows from (2.61) that the position of a 
particle that has well defined momentum is maximally uncertain: all values of 
x are equally probable because \u p \ 2 is independent of x. This phenomenon 
is said to be a consequence of the uncertainty principle, 8 namely that 
when an observable has a well-defined value, all values of the canonically 
conjugate observable are equally probable. 

We can gain useful insight into the workings of the uncertainty principle 
by calculating the variance in momentum measurements for states in which 
measurements of position are subject to varying degrees of uncertainty. For 
definiteness we take the probability density \i/{x)\ 2 to be a Gaussian distri¬ 
bution of dispersion a. So we write 


ip(x) 


1 e -z 2 /4<x 2 

(27TC7 2 ) 1 / 4 


(2.62) 


With equations (2.46) and (2.61) we find that in this state the amplitude to 
measure momentum p is 


W> = / dxu;(xMx) = ^_ (2 J /4 J dxe- ipx / n e~ x2 ^ 2 . (2.63) 


7 Louis de Broglie (1892-1987) was the second son of the Due de Broglie. In 1924 his 
PhD thesis introduced the concept of matter waves, by considering relativistic invariance 
of phase. For this work he won the 1929 Nobel prize for physics. In later years he struggled 
to find a causal rather than probabilistic interpretation of quantum mechanics. 

8 First stated by Werner Heisenberg, Z. Phys., 43, 172 (1927), and consequently often 
called ‘Heisenberg’s uncertainty principle’. 
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Box 2.3 explains how integrals of this type are evaluated. Setting a = i p/Ti 
and b = (2<r) -1 in equation (2) of Box 2.3 we find 


{P IV>) 


2 a V n c -cr 2 p 2 /h 2 

\/Ti.(2'ko 2 ) 1 / 4 


1 -* 2 p 2 m 2 . 

(27r?i 2 /4(j 2 ) 1 / 4 


(2.64) 


The probability density |(p|^)| 2 is a Gaussian centred on zero with a disper¬ 
sion Op in p that equals Ti/2o. Thus, the more sharply peaked the particle’s 
probability distribution is in x , the broader the distribution is in p. The 
product of the dispersions in x and p is always \fi: o p o = \fi. 

This trade-off between the uncertainties in x and p arises because when 
we expand 1 if>) in eigenkets of p , localisation of the probability amplitude 
il>(x) is caused by interference between states of different momenta: in the 
position representation, these states are plane waves of wavelength h/p that 
have the same amplitude everywhere, and interference between waves of very 
different wavelengths is required if the region of constructive interference is 
to be strongly confined. 


2.3.3 Dynamics of a free particle 

We now consider the motion of a free particle - one that is subject to no forces 
so we can drop the potential term in the Hamiltonian (2.51). Consequently, 
the Hamiltonian of a free particle, 



(2.65) 


is a function of p alone, so its eigenkets will be the eigenkets (2.61) of p. 
By expressing any ket \tp) as a linear combination of these eigenkets, and 
using the basic time-evolution equation (2.32), we can follow the motion of 
a particle from the initial state \ip). Wc illustrate this procedure with the 
case in which «/> corresponds to the particle being approximately at the origin 
with momentum near some value po■ Equation (2.64) gives (p\ip) for the case 
in which po vanishes. The amplitude distribution that we require is 


(pIiM) 


_ _ _ e -° 2 (p-po) 2 /n 2 

(2ttTi 2 /Ao 2 ) 1 / 4 


( 2 . 66 ) 


We can now use (2.32) to obtain the wavefunction t, units of time later 


<*hM>= / dpix^ip^e-^/ 2 ™* 


dpe 11 


(2.67) 


V / 7z(27r7l 2 /4CT 2 ) 1 / 4 


Evaluating the integral in this expression involves some tiresome algebra 
- you can find the details in Box 2.4 if you are interested. We want the 
probability density at time t of finding the particle at x , which is the mod- 
square of equation (2.67). From the last equation of Box 2.4 we have 


\(x\i/j,t)\ 2 


V^n 2 \b \ 2 exp 


-{x 


-p 0 t/m) 2 o 2 

2Ti A \b\ 4 


( 2 . 68 ) 


This is a Gaussian distribution whose centre moves with the velocity Po/m 
associated with the most probable momentum in the initial data (2.66). The 
square of the Gaussian’s dispersion is 


fit 


o 2 (t)=o 2 + 


2 mo 


2 


(2.69) 
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Box 2.4: Evaluating the integral in equation (2.67) 


The integral is of the form discussed in Box 2.3. To clean it up we 
replace the p 2 in the third exponential with (p — po) 2 + 2poP — Po and 
gather together all three exponents: 


(x\ip,t) =- 




mh 


/ 4a 2 ) 1 / 4 
x /d P exp{|(: 
In Box 2.3 we now set 

Pot 


a = — x — 


and conclude that 

p iplt/2mh 

(x\ip,t) = 


\/h.(2irh 2 /4<j 2 ) - 1 / 4 


exp 



Pot\ 1 V /7r c -fa- Pn t/m) 2 /4h 2 b 2 

\ h \ m J / b 


This is a complicated result because b is a complex number, but its mod- 
square, equation (2.68), is relatively simple. 


We saw above that in the initial data the uncertainty in p is ~ a p = Ti/2tj, 
which translates to an uncertainty in velocity A„ ~ h/2ma. After time t 
this uncertainty should lead to an additional uncertainty in position A x = 
A v t ~ fit/2mo in perfect agreement with equation (2.69). 

These results complete the demonstration that the identification of the 
operator p defined by equation (2.49) with the momentum operator, together 
with the Hamiltonian (2.51), enable us to recover as much of Newtonian me¬ 
chanics as we expect to continue valid outside the classical regime. The idea 
that in an appropriate limit the predictions of quantum mechanics should 
agree with classical mechanics is often called the correspondence prin¬ 
ciple. The discipline of checking that one’s calculations comply with the 
correspondence principle is useful in several ways: (i) it provides a check on 
the calculations, helping one to locate missing factors of i or incorrect signs, 
(ii) it deepens one’s understanding of classical physics, and (iii) it draws at¬ 
tention to novel predictions of quantum mechanics that have no counterparts 
in classical mechanics. 

In the process of checking the correspondence principle for a free particle 
we have stumbled on a new principle, the uncertainty principle, which implies 
that the more tightly constrained the value of one observable is, the more 
uncertain the value of the conjugate variable must be. Notice that these 
uncertainties do not arise from measurement errors: we have assumed that 
x and p can be measured exactly. The uncertainties we have discussed are 
inherent in the situation and can only be increased by deficiencies in the 
measurement process. 

Our calculations have also shown how far-reaching the principle of quan¬ 
tum interference is: equation (2.67), upon which our understanding of the 
dynamics of a free particle rests, expresses the amplitude for the particle to 
be found at x at time t as an integral over momenta of the amplitude to travel 
at momentum p. It is through interference between the infinite number of 
contributing amplitudes that classically recognisable dynamics is recovered. 
Had we mod-squared the amplitudes before adding them, as classical prob¬ 
ability theory would suggest, we would have obtained entirely unphysical 
results. 
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2.3.4 Back to two-slit interference 

When we discussed the two-slit interference experiment in §1.2.1, we stated 
without proof that 0i — 02 oc x, where 4>i(x) is the phase of the amplitude 
Ai(x) for an electron to arrive at the point x on the screen P after passing 
through the slit S,. We can now justify this assertion and derive the constant 
of proportionality. Once the constant has been determined, it is possible to 
assess the feasibility of the experiment from a practical point of view. 

We assume that the quantum state of an electron as it emerges from 
the electron gun can be approximated by a state of well defined momentum 
|p). So the wavefunction between the gun and the screen with the slits is a 
plane wave of wavelength A = h/p. As an electron passes through a slit we 
assume that it is deflected slightly but retains its former kinetic energy. So we 
approximate its wavefunction after passing through the slit by a wave that is 
no longer plane, but still has wavelength A. Hence the phase of this wave at 
position x on the screen P will be the phase at the slit plus 2ttD/X = pD/h, 
where D(x) is the distance from x to the slit. By Pythagoras’s theorem 

D = \/L 2 + (x±s) 2 , (2.70) 


where L is the distance between the screen with the slits and P, 2s is the 
distance between the slits, and the plus sign applies for one slit and the minus 
sign to the other. We assume that both x and s are much smaller than L 
so the square root can be expanded by the binomial theorem. We then find 
that the difference of the phases is 


0i ~ 02 


2psx 

hL 


(2.71) 


The distance X between the dark bands on P is the value of x for which the 
left side becomes 27t, so 



(2.72) 


Let’s put some numbers into this formula. Since h = 6.63 x 10 -34 J s is very 
small, there is a danger that X will come out too small to produce observable 
bands. Therefore we choose L fairly large and both p and s small. Suppose 
we adopt 1 m for L and 1 pm for s. From the Hamiltonian (2.65) we have p = 
yj2mE. A reasonable energy for the electrons is E = 100 eV = 1.6 x 10~ 17 J, 
which yields p = 5.5 x 10~ 24 Ns, and X = 0.057 mm. Hence there should be 
no difficulty observing a sinusoidal pattern that has this period. 

What do the numbers look like for bullets? On a firing range we can 
probably stretch L to 1000 m. The distance between the slits clearly has 
to be larger than the diameter of a bullet, so we take s = 1 cm. A bullet 
weighs ~ lOgrn and travels at ~ 300ms -1 . Equation (2.72) now yields 
X ~ 10 _29 m. So it is not surprising that fire-arms manufacturers find 
classical mechanics entirely satisfactory. 


2.3.5 Generalisation to three dimensions 

Real particles move in three dimensions rather than one. Fortunately, the 
generalisation to three dimensions of what we have done in one dimension is 
straightforward. 

The x, y and z coordinates of a particle are three distinct observables. 
Their operators commute with one another: 

[xi,Xj\= 0. (2.73) 

Since these are commuting observables, there is a complete set of mutual 
eigenkets, {| x)}. We can express any state of the system, | tf>), as a linear 
combination of these kets: 

10) = J d 3 x(x|V>) |x) = J d 3 x^(x) |x), 


(2.74) 
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where the wavefunction ip(x) is now a function of three variables, and the 
integral is over all space. 

The x, y and z components of the particle’s momentum p commute with 
one another: 

\puPj) = 0- (2.75) 

In the position representation, these operators are represented by partial 
derivatives with respect to their respective coordinates 


Pi = —iH 


dxi 


so p = — iftV . 


(2.76) 


The momenta commute with all operators except their conjugate coordinate, 
so the canonical commutation relations are 


[xi,pj] — ihSij. (2.77) 

In §4.2 we will understand the origin of the factor 6ij. Since the three mo¬ 
mentum operators commute with one another, there is a complete set of 
mutual eigenstates. Analogously to equation (2.61), the wavefunction of the 
state with well defined momentum p is 

(*|P> = jk e " P/h - (2 ' 78) 

In the position representation the tdse of a particle of mass m that 
moves in a potential V (x) reads 

= = ^ + ( X I F ( X M‘ ( 2 - 79 ) 

Now (x.\p 2 \ip) = — h 2 \/ 2 (x.\%p) and (x| V(x.)\ip) = I7(x)(x| tp). Hence using the 
definition ip(x) = (x|^), the tdse becomes 

iS w = -£ v ^ +v ' (xW ' (2 - 80) 

Probability current Max Born 9 first suggested that the mod-square of 
a particle’s wavefunction, 

p(x,t) = \ip(x,t)\ 2 (2.81) 

is the probability density of finding the particle near x at time t. Since 
the particle is certain to be found somewhere , this interpretation implies 
that at any time f d 3 x |^(x, t )| 2 = 1. It is not self-evident that this physical 
requirement is satisfied when the wavefunction evolves according to the TDSE 
(2.80). We now show that it is in fact satisfied. 

We multiple the tdse (2.80) by ip* and subtract from it the result of 
multiplying the complex conjugate of (2.80) by ip. Then the terms involving 
the potential V (x) cancel and we are left with 

m { rd i + *°-w) = is®) 

The left side of this equation is a multiple of the time derivative of p = 
The right side can be expressed as a multiple of the divergence of the 

probability current 


J (x)E^-(W-fV^). (2.83) 

9 For this insight Born won the 1954 Nobel Price for physics. In fact the text of the 
key paper (Born, M., Z. Physik, 37 863 (1926)) argues that is the probability density, 
but a note in proof says “On more careful consideration, the probability is proportional 
to the square of •0”. 
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That is, equation (2.82) can be written 

f = (2.84, 

In fluid mechanics this equation with J = px expresses the conservation of 
mass as a fluid of density p flows with velocity v(x). In quantum mechanics 
it expresses conservation of probability. To show that this is so, we simply 
integrate both sides of equation (2.84) through a volume V. Then we obtain 

4- [ d 3 xp= [ d 3 x^ = - [ d 3 xV- J = - <f d 2 S • J, (2.85) 
(it J v J v ot J v J dv 

where the last equality uses the divergence theorem and dV denotes the 
boundary of V. Equation (2.85) states that the rate of increase of the proba¬ 
bility P = f v d 3 x p of finding the particle in V is equal to minus the integral 
over the volume’s bounding surface of the probability flux out of the volume. 
If V encompasses all space, ip, and therefore J, will vanish on the boundary, 
so f v d 3 xp will be constant. 

We can gain valuable insight into the meaning of a wavefunction by 
explicitly breaking ip into its modulus and phase: 

V>(x) = S(x)e i<Kx \ (2.86) 


where S and </> are real. Substituting this expression into the definition (2.83) 
of J, we find 

J = -^-(SVS - iS 2 V<p - SVS - iS 2 Vcp) = — S 2 Vcp. (2.87) 
2 m m 

Since S 2 = \ip\ 2 = p, the velocity v that is defined by setting J = px is 


v = 


hVcp 

m 


( 2 . 88 ) 


Thus the gradient of the phase of the wavefunction encodes the velocity at 
which the probability fluid flows. In classical physics, this is the particle’s 
velocity. The phase of the wavefunction (2.78) of a particle of well-defined 
momentum is <p(x.) = x • p/h, so in this special case v = p/m as in classical 
physics. Equation (2.88) extends the connection between velocity and the 
gradient of phase to general wavefunctions. 

The virial theorem We illustrate the use of the canonical commutations 
relations equations (2.73), (2.75) and (2.77) by deriving a relation between 
the kinetic and potential energies of a particle that is in a stationary state. 
In §2.2.1 we showed that all expectation values are time-independent when 
a system is in a stationary state. We apply this result to the operator x • p 


° = iftl<x.p> = <£?| x-p,|-+V(x) | E) 
= ^(-Ellx- P,P 2 ]\E) + (£|[x.p,V(x)]|25>. 


The first commutator can be expanded thus 


(2.89) 


[x • p,p 2 ] = ^[xjPjiPl] = Y^&j’PklPj = 2[h Pk$okPj = 2ihp 2 . (2.90) 

jk jk jk 


In the position representation the second commutator is simply 


[x • p, V(x)] = -iftx-VV(x). 


(2.91) 
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When we put these results back into (2.89) and rearrange, we obtain the 

virial theorem 

2(E\^-\E) = (E\(x-\7V)\E). (2.92) 

In important applications the potential is proportional to some power of 
distance from the origin: V(x) = C|x| Q . Then, because V|x| = x/|x|, the 
operator on the right is x • VU = a(7|x| Q = aV and the virial theorem 
becomes 

2(E\^-\E)=a(E\V\E). (2.93) 

So twice the kinetic energy is equal to a times the potential energy. For 
example, for a harmonic oscillator a = 2, so kinetic and potential energies 
are equal. The other important example is motion in an inverse-square force 
field, such as the electrostatic field of an atomic nucleus. In this case a = — 1, 
so twice the kinetic energy plus the potential energy vanishes. Equivalently, 
the kinetic energy is equal in magnitude but opposite in sign to the total 
energy. 

Problems 

2.1 How is a wave-function ijj(x) written in Dirac’s notation? What’s the 
physical significance of the complex number ip(x) for given xl 

2.2 Let Q be an operator. Under what circumstances is the complex num¬ 
ber (a\Q\b) equal to the complex number ((6|Q|a))* for any states |a) and 

w 

2.3 Let Q be the operator of an observable and let | if>) be the state of our 
system. 

a. What are the physical interpretations of {ip\Q\tp) and |(g n |V ; }| 2 i where 
| q n ) is the n th eigenket of the observable Q and q n is the corresponding 
eigenvalue? 

b. What is the operator \Qn){Qn |> where the sum is over all eigenkets 
of Q ? What is the operator J2 n <?n|9n)(<?n|? 

c. If u n (x) is the wavefunction of the state | q n ), write dow an integral that 
evaluates to {q n \ij)). 

2.4 What does it mean to say that two operators commute? What is the 
significance of two observables having mutually commuting operators? 

Given that the commutator [P, Q\ ^ 0 for some observables P and Q, 
does it follow that for all \ip) ^ 0 we have [P, Q]\ip) ^ 0? 

2.5 Let ip(x,t) be the correctly normalised wavefunction of a particle of 
mass to and potential energy V(x). Write down expressions for the expec¬ 
tation values of (a) x ; (b) x 2 ; (c) the momentum p x ; (d) p 2 ; (e) the energy. 

What is the probability that the particle will be found in the interval 

(xi,x 2 ) r ! 

2.6 Write down the time-independent (tise) and the time-dependent (tdse) 
Schrodinger equations. Is it necessary for the wavefunction of a system to 
satisfy the TDSE? Under what circumstances does the wavefunction of a 
system satisfy the TISE? 

2.7 Why is the tdse first-order in time, rather than second-order like New¬ 
ton’s equations of motion? 

2.8 A particle is confined in a potential well such that its allowed energies 
are E n = n 2 8 , where n = 1,2,... is an integer and £ a positive constant. 

The corresponding energy eigenstates are 11), |2), ..., |n),_ At t = 0 the 

particle is in the state 


|^( 0 )) = 0 . 211 ) + 0 . 3 | 2 ) + 0 . 4 | 3 ) + 0 . 843 | 4 ). 


( 2 . 94 ) 
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a. What is the probability, if the energy is measured at t = 0 of finding a 
number smaller than 6 £? 

b. What is the mean value and what is the rms deviation of the energy of 
the particle in the state |-0(O))? 

c. Calculate the state vector | tjj) at time t. Do the results found in (a) and 
(b) for time t remain valid for arbitrary time tl 

d. When the energy is measured it turns out to be 16£. After the mea¬ 
surement, what is the state of the system? What result is obtained if 
the energy is measured again? 

2.9 A system has a time-independent Hamiltonian that has spectrum { E n }. 
Prove that the probability Pk that a measurement of energy will yield the 
value Ek is is time-independent. Hint: you can do this either from Ehrenfest’s 
theorem, or by differentiating (Ek,t\il>) w.r.t. t and using the tdse. 

2.10 Let ip(x) be a properly normalised wavefunction and Q an opera¬ 
tor on wavefunctions. Let {g r } be the spectrum of Q and {u r {x)} be the 
corresponding correctly normalised eigenfunctions. Write down an expres¬ 
sion for the probability that a measurement of Q will yield the value q r . 
Show that J2 r P{q r \4>) = 1- Show further that the expectation of Q is 
(Q) = J^rQ^dx.^ 

2.11 Find the energy of neutron, electron and electromagnetic waves of 
wavelength 0.1 nrn. 

2.12 Neutrons are emitted from an atomic pile with a Maxwellian distribu¬ 
tion of velocities for temperature 400 K. Find the most probable de Broglie 
wavelength in the beam. 

2.13 A beam of neutrons with energy E runs horizontally into a crystal. 
The crystal transmits half the neutrons and deflects the other half vertically 
upwards. After climbing to height El these neutrons are deflected through 90° 
onto a horizontal path parallel to the originally transmitted beam. The two 
horizontal beams now move a distance L down the laboratory, one distance H 
above the other. After going distance L , the lower beam is deflected vertically 
upwards and is finally deflected into the path of the upper beam such that 
the two beams are co-spatial as they enter the detector. Given that particles 
in both the lower and upper beams are in states of well-defined momentum, 
show that the wavenumbers k, k' of the lower and upper beams are related 

In an actual experiment (R. Colella et ah, 1975, Phys. Rev. Let., 34, 1472) 
E = 0.042 eV and LH ~ 10 -3 m 2 (the actual geometry was slightly differ¬ 
ent). Determine the phase difference between the two beams at the detector. 
Sketch the intensity in the detector as a function of H. 

2.14 A particle moves in the potential V(x) and is known to have energy 
E n . (a) Can it have well defined momentum for some particular V(x)? (b) 
Can the particle simultaneously have well-defined energy and position? 

2.15 The states {11), 12)} form a complete orthonormal set of states for a 
two-state system. With respect to these basis states the operator o y has 
matrix 

(2.96) 

Could a be an observable? What are its eigenvalues and eigenvectors in the 
{11), 12)} basis? Determine the result of operating with <j y on the state 

W = 4(|1) - |2». (2.97) 




10 In the most elegant formulation of qantum mechanics, this last result is the basic 
postulate of the theory, and one derives other rules for the physical interpretation of the q n , 
a n etc. from it - see J. von Neumann, Mathematical Foundations of Quantum Mechanics. 
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2.16 A three-state system has a complete orthonormal set of states |1), |2), |3). 
With respect to this basis the operators H and B have matrices 


0 

0 \ 

/I 

0 

°\ 

-1 

0 

5 = 60 

0 

1 

0 

-1/ 

\o 

1 

0 / 


(2.98) 


where u> and b are real constants. 

a. Are H and B Hermitian? 

b. Write down the eigenvalues of H and find the eigenvalues of B. Solve for 
the eigenvectors of both H and B. Explain why neither matrix uniquely 
specifies its eigenvectors. 

c. Show that H and B commute. Give a basis of eigenvectors common to 
H and B. 

2.17 Given that A and B are Hermitian operators, show that i[A, B] is a 
Hermitian operator. 

2.18 Given a ordinary function f(x) and an operator R, the operator f(R) 
is defined to be 

f( R ) = S ^f(Ti)\ r i){ r i\, (2.99) 

i 

where are the eigenvalues of R and \ri) are the associated eigenkets. Show 
that when f(x) = x 2 this definition implies that f(R) = RR, that is, that 
operating with f(R) is equivalent to applying the operator R twice. What 
bearing does this result have in the meaning of e R ? 

2.19 Show that if there is a complete set of mutual eigenkets of the Hermi¬ 
tian operators A and B , then [A, B] = 0. Explain the physical significance 
of this result. 

2.20 Given that for any two operators (ABy = B^A\ show that 


(ABCD)^ = D t G t H t A t . (2.100) 


2.21 Prove for any four operators A, B, C, D that 

[ABC, D} = AB[C, D] + A[B, D]C + [A, D]BC. (2.101) 

Explain the similarity with the rule for differentiating a product. 

2.22 Show that for any three operators A, B and C, the Jacobi identity 
holds: 

[A, [ B , C\] + [B , \C, A]] + [C, [A, B]\ = 0. (2.102) 

2.23 Show that a classical harmonic oscillator satisfies the virial equation 
2(KE) = a(PE) and determine the relevant value of a. 

2.24 Given that the wavefunction is ip = Ae l ^ kz ~ utS) + Be~^ kz+Ut \ where 
A and B are constants, show that the probability current density is 

J = v(\A\ 2 - \B\ 2 ) z, (2.103) 

where v = Uk/m. Interpret the result physically. 
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Harmonic oscillators and magnetic 
fields 


Harmonic oscillators are of enormous importance for physics because most 
of condensed-matter physics and quantum electrodynamics centre on weakly 
perturbed harmonic oscillators. The reason harmonic oscillators are so com¬ 
mon is simple. The points of equilibrium of a particle that moves in a 
potential V(x) are points at which the force — dV/dx vanishes. When we 
place the origin of x at such a point, the Maclaurin expansion of V becomes 
V(x) = constant + \V"x 2 + 0(x 3 ), and the force on the particle becomes 
F = —V"x + 0(x 2 ). Consequently, for sufficiently small excursions from the 
point of equilibrium, the particle’s motion will be well approximated by a 
harmonic oscillator. 

Besides providing the background to a great many branches of physics, 
our analysis of a harmonic oscillator will introduce a technique that we will 
use twice more in our analysis of the hydrogen atom. As a bonus, we will find 
that our results for the harmonic oscillator enable us to solve another impor¬ 
tant, and apparently unrelated problem: the motion of a charged particle in 
a uniform magnetic field. 


3.1 Stationary states of a harmonic oscillator 

We can build a harmonic oscillator by placing a particle in a potential that 
increases quadratically with distance from the origin. Hence an appropriate 
Hamiltonian is given by equation (2.51) with V <x x 2 . 1 For later convenience 
we choose the constant of proportionality such that H becomes 

H =-^{p 2 + (mux) 2 }. (3.1) 

In §2.2 we saw that the dynamical evolution of a system follows immediately 
once we know the eigenvalues and eigenkets of H. So we now determine 
these quantities for the Hamiltonian (3.1). 

1 In the last chapter we distinguished the position and momentum operators from their 
eigenvalues with hats. Henceforth we drop the hats; the distinction between operator and 
eigenvalue should be clear from the context. 



38 


Chapter 3: Harmonic oscillators and magnetic fields 


We next introduce the dimensionless operator 

^ _ mux + ip 
VZmhuj 


(3.2a) 


This operator isn’t Hermitian. Bearing in mind that x and p are Hermitian, 
from the rules in Table 2.1 we see that its adjoint is 


At = m “ X - ip (3.2b) 

The product A 1 A is 

A^A = ——-—(mux — ip) (mux + ip) 

2m 1 S " H (3.3) 

= ^{(™) 2 + ™»[*.!>] + p 2 } = J ~ ~ b 

where we have used the canonical commutation relation (2.54). This equation 
can be rewritten H/{Tiu) = A^A+ so A is rather nearly the square root of 
the dimensionless Hamiltonian H/Tiu. If we calculate AA 1 in the same way, 
the only thing that changes is the sign in front of the commutator [x,p\, so 
we have 

AA* = + |. (3.4) 

ruo z 

Subtracting equation (3.4) from equation (3.3) we find that 

H f ,A] = -l. (3.5) 

We will find it useful to have evaluated the commutator of A t with the 
Hamiltonian. Since from equation (3.3) H = hu(A jl A + |), we can write 

[A\H] = hu[A\ A^A] = huA^[A\ A] = —TiuA\ (3.6) 

where we have exploited the rules of equations (2.22). 

We now multiply both sides of the defining relation of \E„), namely 
H\E n ) =E n \E n ), by At: 

A'E n \E n ) = A^H\E n ) = ( HA t + [A*,H])\E n ) = {H - hu)A*\E n ). (3.7) 

A slight rearrangement of this equation yields 

H(A'\E n )) = (E n + hu)(A* \E n )). (3.8) 

Provided |6) = A^\E n ) has non-zero length-squared, this shows that | b) is an 
eigenket of H with eigenvalue E n + hu. The length-square of |6) is 

\A ] \E n )\ 2 = (E n \AA\E n ) = (E n | (JL + ^ | E n ) = ^ + ±. (3-9) 

Now squeezing H between (E n \ and \E n ) we find with (3.1) that 

E n = {E n \H\E n ) = J—(\ p \E n )\ 2 + m 2 u 2 \x\E n )\ 2 ) >0. (3.10) 

zmuj 

Thus the energy eigenvalues are non-negative, so |A^|i? rl )| 2 > 0 and by 
repeated application of A 1 we can construct an infinite series of eigenstates 
with energy E n + khu for k = 0,1,... 

Similarly, we can show that provided A\E n ) has non-zero length-squared, 
it is an eigenket of H for energy E n — Tiu. Since we know that all eigenvalues 
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are non-negative, for some energy Eq, A\Eq) must vanish. Equating to zero 
the length-squared of this vector we obtain an equation for E 0 : 

°=IW | 2 = (£ol(|;-i)|Eo> = g-i. On) 

So Eq = ^hco and we have established that the eigenvalues of H are 


fiwx(|, ..., ...) that is E r = (r + \)Tiu. 


(3.12) 


The operators A 1 and A with which we have obtained these important 
results are respectively called creation and annihilation operators be¬ 
cause the first creates an excitation of the oscillator, and the second destroys 
one. In quantum field theory particles are interpreted as excitations of the 
vacuum and each particle species is associated with creation and annihilation 
operators that create and destroy particles of the given species. A and A' 
are also called ladder operators. 

We now examine the eigenkets of H. Let |r) denote the state of energy 
(r+ \)Tux. In this notation the lowest-energy state, or ground state, is | 0 > 
and its defining equation is H|0) = 0. From equation (3.2a) this equation 
reads 


0 = H|0) 


mu>x\t)) + ip|0 ) 
a/2 rnMo 


(3.13) 


We now go to the position representation by multiplying through by (x|. 
With equations (2.48) and (2.49) we find that the equation becomes 


1 f 

\Z2rnhuj V 


muix - 


dx 


(x|0) = 0. 


(3.14) 


This is a linear, first-order differential equation. Its integrating factor is 
exp(mu)x 2 /2h), so the correctly normalised wavefunction is 


= (3 - 15> 

Notice that this solution is unique, so the ground state is non-degenerate. 
It is a Gaussian function, so the probability distribution P(x) = |(ar|0)| 2 for 
the position of the particle that forms the oscillator is also a Gaussian: its 
dispersion is £. 

From equations (2.63) and (2.64) we see that the momentum distribution 
of the wavefunction (3.15) is 

P(p) = |(p|0)| 2 ae- 2 ^ 2 / ft2 , (3.16) 

which is a Gaussian with dispersion cr p = Ti/2£. By inserting x = £ and 
p = o p in the Hamiltonian (3.1) we obtain estimates of the typical kinetic 
and potential energies of the particle when it’s in its ground state. We find 
that both energies are ~ jTna. In fact one can straightforwardly show that 
H(£,o p ) is minimised subject to the constraint £o p > h/2 when £ and o p 
take the values that we have derived for the ground state (Problem 3.4). In 
other words, in its ground state the particle is as stationary and as close to 
the origin as the uncertainty principle permits; there is a conflict between the 
advantage energetically of being near the origin, and the energetic penalty 
that the uncertainty principle exacts for having a well defined position. 

Every system that has a confining potential exhibits an analogous zero- 
point motion. The energy tied up in this motion is called zero-point 
energy. Zero-point motion is probably the single most important prediction 
of quantum mechanics, for the material world is at every level profoundly 
influenced by this phenomenon. 
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We obtain the wavefunctions of excited states by applying powers of 
the differential operator A' to (a;|0). Equation (3.9) enables us to find the 
normalisation constant a in the equation \n + 1) = aA'\n)\ it implies that 
a 2 = n+ 1. the generalisation of equation (3.11) enables us to determine the 
number /3 in the equation |n — 1) = f3A\n), and we have finally 

\n + l) = -^=A\n) ; \n - l) = ±=A\n). (3.17) 

Vn + 1 \Jn 

It is useful to remember that the normalisation constant is always the square 
root of the largest value of n appearing in the equation. As a specific example 

W1> = ^("“ x -4) <a:|0> = (i- < ^) <x|0> (318) 

1 x -x 2 /U 2 

(2tt^) 1 /4 £ 


Whereas the ground-state wavefunction is an even function of x, the wave- 
function of the first excited state is an odd function because A^ is odd in x. 
Wavefunctions that are even in x are said to be of even parity, while those 
that are odd functions have odd parity. It is clear that this pattern will 
be repeated as we apply further powers of A^ to generate the other states 
of well-defined energy, so ( x\n ) is even parity if n is even, and odd parity 
otherwise. 

Notice that the operator N = A 1 A is Hermitian. By equations (3.17) 
N\n) = n\n), so its eigenvalue tells you the number of excitations the oscil¬ 
lator has. Hence N is called the number operator. 

Let’s use these results to find the mean-square displacement (n|a; 2 |n) 
when the oscillator is in its n th excited state. Adding equations (3.2) we 
express a; as a linear combination of A and Ai 


x = ] f^(A + A) = l(A + A), (3.19) 

where i is defined by (3.15), so 

(n|a: 2 |n) = £ 2 (n\(A +A^) 2 \n). (3.20) 

When we multiply out the bracket on the right, the only terms that con¬ 
tribute are the ones that involve equal numbers of As and A^s. Thus 

(n|x 2 |n) = f' 2 (n|(AA t +A t A)|n) =£ 2 {2n + l) =t 2 ^, (3.21) 

huj 

where we have used equations (3.17) and (3.12). If we use equation (3.15) to 
eliminate £, we obtain a formula that is valid in classical mechanics (Prob¬ 
lem 2.23). 
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3.2 Dynamics of oscillators 

By equations (2.29) and (3.12), the n th excited state of the harmonic oscil¬ 
lator evolves in time according to 

\n,t) = e- i(ra+1/2)aJt |n,0) (3.22) 

Consequently, no state oscillates at the oscillator’s classical frequency w. How 
do we reconcile this result with classical physics? 

We have seen that we make the link from quantum to classical physics 
by considering the expectation values of observables if classical physics 
applies, the measured value of any observable will lie close to the expectation 
value, so the latter provides an accurate description of what’s happening. 
Equation (2.35) tells us that when a system is in an energy eigenstate, the 
expectation value of any time-independent observable Q cannot depend on 
time. Equation (3.22) enables us to obtain this result from a different point of 
view by showing that when we form the expectation value ( Q ) = {ip\Q\i/)), the 
factor e~ lE,lt / h in the ket | ip,t) = e~ lEnt / h \E n ) cancels on the corresponding 
factor in \. Hence energy eigenstates are incapable of motion. 2 The 
system is capable of motion only if there are non-negligible amplitudes to 
measure more than one possible energy, or, equivalently, if none of the cq in 
the sum (2.32) has near unit modulus. 

Consideration of the motion of a harmonic oscillator will make this gen¬ 
eral point clearer. If the oscillator’s state is written 

\^t)=Y / a j e- iE ^ n \j), (3.23) 

then the expectation value of x is 

<*> = Y.<^ Ek - Ei)t/h mj) = J2 a *k a ^ i{k ~ j)Ut ( k \x\3). (3.24) 

jk jk 


We simplify this expression by using equation (3.19) to replace x with f(A + 
A^) and then using (3.17) to evaluate the matrix elements of A and A': 

( a ;)=^4a i e i ( fe -^(fc|(A + At)|i) 

■* k (3.25) 

= ^^2 a k a j el{k ~ 3)ut (Vj(k\j - 1 ) + VT+l{k\j + 1 ). 

jk 


Since ( k\j — 1) vanishes unless k = j — 1, it’s now easy to perform the sum 
over fc, leaving two terms to be summed over j. On account of the factor 
y/j we can restrict the first of these sums to j > 0 , and in the second sum 
we replace j by j' = j + 1 and then replace the symbol j' by j so we can 
combine the two sums. After these operations we have 

(x) = + a*cij- 1 e lut ) 

( 3 ' 26a ) 

= 2, Xj COS (cot + (f)j), 

3 

where the real numbers Xj and (f>j are defined by 

2 y/jea* j a j -i=X j e i ^. (3.26b) 

Thus (x) oscillates sinusoidally at the classical frequency oj regardless of the 
amplitudes cij. Thus we have recovered the classical result that the frequency 

2 If we consider that t is the variable canonically conjugate to energy, this fact becomes 
a manifestation of the uncertainty principle. 
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Figure 3.1 The potential energy V(x ) of an anharmonic oscillator (full curve) and V(x) 
for the harmonic oscillator obtained by restricting the potential to the first two terms in 
its Maclaurin expansion (dashed curve). 

at which a harmonic oscillator oscillates is independent of amplitude and 
equal to yjk/m , where k is the oscillator’s spring constant. 

In the classical regime, the only non-negligible amplitudes aj have in¬ 
dices j that cluster around some large number n. Consequently, a measure¬ 
ment of the energy is guaranteed to yield a value that lies close to E = E n , 
and from equation (3.21) it follows that the mean value of x 2 will lie close 
to x 2 = 2(, 2 E n /(hui). Classically, the time average of x 2 is proportional to 
the average potential energy, which is just half the total energy. Hence, av¬ 
eraging the Hamiltonian (3.1) we conclude that classically x 2 = E/(imo 2 ), 
in precise agrees with the quantum-mechanical result. The correspondence 
principle requires the classical and quantum-mechanical values of x 2 to agree 
for large n. That they agree even for small n is a coincidence. 


3.2.1 Anharmonic oscillators 

The Taylor series of the potential energy V(x ) of a harmonic oscillator is 
very special: it contains precisely one non-trivial term, that proportional to 
x 2 . Real oscillators have potential-energy functions that invariably deviate 
from this ideal to some degree. The deviation is generally in the sense that 
V (x) < 7}V"(0)x 2 for x > 0 - see Figure 3.1. One reason why deviations from 
harmonicity are generally of this type is that it takes only a finite amount of 
energy to break a real object, so V (oo) should be finite, whereas the potential 
energy function of a harmonic oscillator increases without limit as x —> oo. 

Consider the anharmonic oscillator that has potential energy 

V(X) = - 4^2 - ( 3 - 27 ) 

cr + x z 

where Vo and a are constants. We cannot find the stationary states of this 
oscillator analytically any more than we can analytically solve its classical 
equations of motion. 3 But we can determine its quantum mechanics nu¬ 
merically, 4 and doing so will help to show which aspects of the results we 

3 Murphy’s law is in action here: the dynamics of the pendulum is analytically in¬ 
tractable precisely because it is richer and more interesting than that of the harmonic 
oscillator. 

4 A good way to do this is to turn the TISE into a finite matrix equation and then to 
use a numerical linear-algebra package to find the eigenvalues of the matrix. Figure 3.2 
was obtained using the approximation ~ ('i/Vi+i H-'i/’n—l — 2ip n )/A 2 , where denotes 
ip(nA) with A a small increment in x. With this approximation the TISE becomes the 
eigenvalue equation of a tridiagonal matrix that has 2b 2 / A 2 + V n /Vo on the leading diag¬ 
onal and —b 2 / A 2 above and below this diagonal, where b 2 = h 2 /2mVo and V n = V(nA). 
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Figure 3.2 The spectrum of the 
anharmonic oscillator for which the 
potential is plotted in Figure 3.1 
when the dimensionless variable 
2 ma 2 Vo/H 2 = 100. 



Figure 3.3 Values of a-j when there is significant uncertainty in E. 


have obtained for the harmonic oscillator are special, and which have general 
applicability. 

Figure 3.2 shows the anharmonic oscillator’s energy spectrum. At low 
energies, when the pendulum is nearly harmonic, the energies are nearly 
uniformly spaced in E. As we proceed to higher energies, the spacing between 
levels diminishes, with the consequence that infinitely many energy levels are 
packed into the finite energy range between — Vo and zero, where the particle 
becomes free. This crowding of the energy levels has the following implication 
for the time dependence of (x). Suppose there are just two energies with non¬ 
zero amplitudes, a n and un+i- Then (x) will be given by 

{x) = a* N a N+1 e i{EN - EN+l)t/h (N\x\N + 1) + complex conjugate. (3.28) 

This is a sinusoidal function of time, but its period, T = }i/{En+\ — En), 
depends on N. If we increase the energy and amplitude of the oscillator, we 
will increase N and Figure 3.2 shows that T will also increase. Classically 
the period of the oscillator increases with amplitude in just the same way. 
Thus there is an intimate connection between the spacing of the energy levels 
and classical dynamics. 

Consider now the case in which the energy is more uncertain, so that 
several of the aj are non-zero, and let these non-zero a,j be clustered around 
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j = N (see Figure 3.3). In this case several terms will occur in the sum for 
(x) 

<») = ••• + a* N _ 1 a N e i{EN - 1 ~ EN '> t/h {N - l\x\N) 

+ a* N+1 a N e i{EN + 1 - EN)t/h (N + l|a:|JV> (3.29) 

+ a* N+3 a N e i{EN+3 - EN)t/h (N + 3|;r|IV) + • • • 

where we have anticipated a result of §4.1.4 below that the matrix element 
(j\x\k) vanishes if j — k is even. The sum (3.29) differs from the correspond¬ 
ing one (3.26a) for a harmonic oscillator in the presence of matrix elements 
(j\x\k) with \j — k\ > 1 : in the case of the harmonic oscillator these ma¬ 
trix elements vanish, but in the general case they won’t. In consequence 
the series contains terms with frequencies (-Eat +3 — En)/Ti as well as terms 
in wjv = (-E/v+i — E]\r)/h. If these additional frequencies were all integer 
multiples of a single frequency u>n, the time dependence of (x) would be 
periodic with period Tjv = 2tt/ujn, but anharmonic, like that of the classical 
oscillator. Now (En +3 — En)/H ~ 3ww because the spacing between energy 
levels changes only slowly with N, so when, as in Figure 3.3, the non-zero 
amplitudes are very tightly clustered around N, the additional frequencies 
will be integer multiples of u>n to good accuracy, and the motion will indeed 
be periodic but anharmonic as classical mechanics predicts. 

If we release the oscillator from near some large extension X , the non- 
negligible amplitudes dj will be clustered around some integer N as depicted 
in Figure 3.3, and their phases will be such that at t = 0 the wavefunctions 
(x\j) will interfere constructively near X and sum to near zero elsewhere, 
ensuring that the mod-square of the wavefunction i/j(x,0) = JA aj(x\j) is 
sharply peaked around x = X. At a general time the wavefunction will be 
given by 

= e- iENt/R J2 ci(EN ~ Ej)t/n a j (x\j)- (3.30) 

3 

Since the spacing of the energy levels varies with index j, the frequencies in 
this sum will not be precisely equal to integer multiples of wjv = (.E/v+i — 
-Ejv)/7i, so after an approximate period Tn = 2it/lon most terms in the 
series will not have quite returned to their values at t = 0. Consequently, 
the constructive interference around x = X will be less sharply peaked than 
it was at t = 0, and the cancellation elsewhere will be correspondingly less 
complete. After each further approximate period T)v, the failure of terms in 
the series to return to their original values will be more marked, and the peak 
in \ijj(x, t)\ 2 will be wider. After a long time t 7jv the instants at which 
individual terms next return to their original values will be pretty uniformly 
distributed around an interval in t of length X)v, and \ip(x,t)\ 2 will cease to 
evolve very much: it will have become a smooth function throughout the 
range \x\ < X. 

This behaviour makes perfectly good sense classically. The uncertainty 
in E that enables the wavefunction to be highly localised at t = 0 corre¬ 
sponds in the classical picture to uncertainty in the initial displacement X. 
Since the period of an anharmonic oscillator is a function of the oscillator’s 
energy, uncertainty in X implies uncertainty in the oscillator’s period. After 
a long time even a small uncertainty in the period translates into a significant 
uncertainty in the oscillator’s phase. Hence after a long time the probability 
distribution for the particle’s position is fairly uniformly distributed within 
\x\ < X even in the classical case. 
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3.3 Motion in a magnetic field 

The formalism we developed for a harmonic oscillator enables us to solve an 
important, and you might have thought unconnected, problem: the motion 
of a particle of mass m and charge Q in a uniform magnetic field of flux 
density B. 

The first question to address when setting up the quantum-mechanical 
theory of a system is, “what’s the Hamiltonian?” because it is the Hamil¬ 
tonian that encodes mathematically how the system works, including what 
forces are acting. So we have to decide what the Hamiltonian should be for 
a particle of charge Q and mass to that moves in a magnetic field B(x). The 
answer proves to be 

H = 7 -—(p — QA) 2 , (3.31) 

2 TO 

where A is the vector potential that generates B through B = V x A. The 
most persuasive theoretical motivation of this Hamiltonian involves relativity 
and lies beyond the scope of this book. However, since we are exploring a new 
and deeper level of physical theory, we can ultimately only proceed by making 
conjectures and then confronting the resulting predictions with experimental 
measurements. In this spirit we adopt equation (3.31) as a conjecture from 
which we can try to recover the known behaviour of a charged particle in 
a magnetic field. In subsequent chapters we will show that this formula 
accounts satisfactorily for features in the spectra of atoms. Hence we can be 
pretty sure that it is correct. 

Since we know the equations of motion of a classical particle in a field B, 
let’s investigate the classical limit in the usual way, by finding the equations 
of motion of expectation values. With equation (2.34) we have that the rate 
of change of the expectation value of the i th component of x is 

in ^r = {[XiiH]) = i ( [Xh (p - QA)2] > • (3 - 32) 

The rules (2.22) and the canonical commutation relation [xi,pj\ = i hSij 
enable us to simplify the commutator 

2mifc d ^ = {[xi, (p - QA)] • (p - QA)) 

+ ((P — QA) • [xi, (p — QA)]) ^ 3 ' 33) 

= 2 i h (pi - QAi) , 


where we have used the fact that x commutes with A because A is a function 
of x only. Thus, with this Hamiltonian 

( p > = + Q ( A > • ( 3 - 34 ) 

that is, the momentum is tox plus an amount proportional to the vector 
potential. It is possible to show that the additional term represents the 
momentum of the magnetic field that arises because the charge Q is moving 
(Problem 3.19). 

In the classical limit we can neglect the difference between a variable and 
its expectation value because all uncertainties are small. Then with (3.34) 
the Hamiltonian (3.31) becomes just 4 tox 2 , which makes perfect sense since 
we know that the Lorentz force Qx x B does no work on a classical particle, 
and the particle’s energy is just its kinetic energy. 

To show that our proposed Hamiltonian generates the Lorentz force, we 
evaluate the rate of change of (p): 


ifi. 


d ( Pi) 


{\Pi,H ]) = (b») (p~ QA)] • (p - QA)) 

+ ((P - QA) • \ Pi , (p- QA)]) }. 


d t 


(3.35) 
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We now use equation (2.25) to evaluate the commutator [p,. A] and conclude 
that 


d (Pi) 
d t 


5s{(sr + 


(3.36) 


Notice that we cannot combine the two terms on the right of this equation 
because p does not commute with A and its derivatives. In the classical 
limit we can replace each operator by its expectation value, and then replace 
(p — Q A) by m(x). Similarly replacing the p on the left, we have in the 
classical limit 


2 Xj dA£ 

df 2 W df 


dA 

dx 


(3.37) 


where we have omitted the expectation value signs that ought to be around 
every operator. The time derivative on the left is along the trajectory of the 
particle (i.e., to be evaluated at ( x ) t ). If A has no explicit time dependence 
because the field B is static, its time derivative is just x- VA^. We move this 
term to the right side and have 


m 



Q 


■ <9 A . 

X • 7T-X • VA,;, 

OXi 


Q{ix (Vx A)}.. 


(3.38) 


Thus our proposed Hamiltonian (3.31) yields the Lorentz force in the classical 
limit. 


3.3.1 Gauge transformations 

Any magnetic field is Gauge invariant: A and A' = A + VA generate 
identical magnetic fields, where A(x) is any scalar function. A potential 
problem with the Hamiltonian (3.31) is that it changes in a non-trivial way 
when we change gauge, which is worrying because H should embody the 
physics, which is independent of gauge. We now show that this behaviour 
gives rise to no physical difficulty providing we change the phases of all kets 
at the same time that we change the gauge in which we write A. The idea 
that a change of gauge in a field such as A that mediates a force (in this 
case the electromagnetic force) requires a compensating change in the ket 
that is used to describe a given physical state, has enormously far-reaching 
ramifications in field theory. 

Suppose '(/’(x) = (x|lf) is an eigenfunction of the Hamiltonian for A: 

(p — QA) 2 \ip) = 2mE\ijj}. (3.39) 

Then we show that 

0(x) = e iQA/ V(x) (3.40) 

is an eigenfunction of the Hamiltonian we get by replacing A with A 7 . We 
start by noting that 

p QA' = p - Q( A + VA) = (p QVA) - QA 

and that for any wavefunction x(x) 

e iQA/R Px(x) = e iQA/n ( - iftVx(x)) 

= -iSV(e iQA/s x) - QVA(e iQA/R x) (3-41) 

= (p-QVA)(e i W»). 

We subtract QAe 1 ® A / h x from each side to obtain 

e iQA/ft (p - Q A)x(x) = (p - QA - QVA)(e i W s x(x)), 


(3.42) 
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and then apply this result to % = (p — QA)ip\ 

e iQA/h (p - QA)V(x) = (p - QA - QVA)e iQA ' h (p - QA)^(x) 

= (p - QA - QVA ) 2 (e iQA /V(x)), 

where the second equality uses (3.42) again, this time with y put 
ip. So if 

Hip = ^(p - QAfip = Eip, 

then 

H'(e iQA/n ip) = ^-(p - QA - QVA) 2 (e iQA/ ^(x)) = E(e iQA/n ip(x)), 

(3.45) 

In words, we can convert an eigenfunction of the Hamiltonian (3.31) with A 
to an eigenfunction of that Hamiltonian with A' = A + VA by multiplying 
it by e 1( ^ A / ft . Notice that A is an arbitrary function of x, so multiplication 
by e l Q A / h makes an entirely non-trivial change to ip(x). 

Given that there is a one-to-one relation between the eigenfunctions of 
H before and after we make a gauge transformation, it is clear that the 
spectrum of energy levels must be unchanged by the gauge transformation. 
What about expectation values? Since both kets and the Hamiltonian un¬ 
dergo gauge transformations, we should be open to the possibility that other 
operators do too. Let R' be the gauge transform of the operator R. Then 
the expectation value of R is gauge invariant if 

(R) = J d 3 xip*(x)Rip(x) = J d 3 xip*(x)e- iQA/h R'e iQA/n ip(x). (3.46) 

Clearly this condition is satisfied for R! = R if R is a function of x only. 
From our work above it is readily seen that if R depends on p, the equation 
is satisfied if p only occurs through the combination (p — QA), as in the 
Hamiltonian . 5 We believe that in any physical situation this condition on 
the occurrence of p will always be satisfied, so all expectation values are in 
fact gauge-invariant. 


(3.43) 
equal to 

(3.44) 


3.3.2 Landau Levels 

We now find the stationary states of a spinless particle that moves in a 
uniform magnetic field. Let the 2 -axis be parallel to B and choose the gauge 
in which A = \B{—y, x, 0). Then from equation (3.31) we have 


^ = 2m I ( Px + ^ By ) 2 + ( p v ~ \Q Bx ) 2 +P 2 z] 

= + 7T 2 ) + g-, 


(3.47a) 


where w = QB/m is the Larmor frequency and we have defined the di¬ 
mensionless operators 6 


p x + 2 mujy 

Tl’x = - - 

VmioU 


y/mwh 


(3.47b) 


H has broken into two parts. The term pi/2m is just the Hamiltonian of a 
free particle in one dimension in §2.3.3 we already studied motion governed 
by this Hamiltonian. The part 


H xy = + 7T 2 ) (3.47c) 

5 The principle that p and A only occur in the combination p — QA is known as the 

principle of minimal coupling. 

6 We are implicitly assuming that QB and therefore uj are positive. It is this assump¬ 
tion that leads to the angular momentum of a gyrating particle never being positive - see 
equation (3.58). 
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is essentially the Hamiltonian of a harmonic oscillator because it is the sum of 
squares of two Hermitian operators that satisfy the canonical commutation 
relation 

!>*,%] = y-dlhPy] - \Px,x}) = i. (3.48) 

The ladder operators are 


a = + i7 T y ) 


a T = -(n x - m y ) 


[a, a f ] = \\iTy, 7r x ] = 1, (3.49) 


and in terms of them H xy is 


H xy = huj(a'a + i). 


(3.50) 


It follows that the energy levels are E = hw(^, §,...)• These discrete energy 
levels for a charged particle in a uniform magnetic field are known as Landau 
levels. 

If particles can move freely parallel to B (which may not be possible in 
condensed-matter systems), the overall energy spectrum will be continuous 
notwithstanding the existence of discrete Landau levels. 

In the case of an electron the Larmor frequency is usually called the 
cyclotron frequency. It evaluates to 176(H/1 T) GHz, so the spacing of 
the energy levels is 1.16 x 10 _4 (I3/1 T) eV. At room temperature electrons 
have thermal energies of order 0.03 eV, so the discreteness of Landau levels 
is usually experimentally significant in the laboratory only if the system is 
cooled to low temperatures and immersed in a strong magnetic field. The 
strongest magnetic fields known occur near neutron stars, where B ~ 10 8 T 
is not uncommon, and in these systems electrons moving from one Landau 
level to the next emit or absorb hard X-ray photons. 

To find the wavefunction of a given Landau level, we write the ground 
state’s defining equation in the position representation 


o|0) =0 -O- 




|mu(r 



(x|0> = 0. 


(3.51) 


We transform to new coordinates u = x + iy, v = x — iy. 
yields 7 


d ii 

f d 

■ 9 ) 

d 1( 

( d 

, . d \ 

du 2 ' 

ydx 

l 9y) 

’ dv 21 

v dx 

dy) 


so a and can be written 

_ .r B ( d | u ^ _ t _ - r B ( d _ v\ 

\j2\dv 4 r 2 B ) ’ Q \J2\du 4 r 2 B ) 


The chain rule 
(3.52) 


(3.53a) 


where 



Equation (3.51) now becomes 


(3.53b) 


<9(x|0) 

dv 


+ -^2-m(x|0) = 0. 
4r 2 B 


(3.54) 


7 Aficianados of functions of a complex variable may ask what d/du can mean since the 
partial derivative involves holding constant v, which appears to be the complex conjugate 
of u. Use of u,v as independent coordinates requires permitting x,y to take on complex 
values. If you are nervous of using this mathematical fiction to solve differential equations, 
you should check that the wavefunction of equation (3.58) really is an eigenfunction of H. 
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Solving this first-order linear o.d.e. we find 

(x|0 )=g(u)e~ uv/4r B (3.55) 

where g(u) is an arbitrary function. On account of the arbitrariness of g(u), 
the ground state of motion in a magnetic field is not unique. This situation 
contrasts with the one we encountered when solving for the ground state of 
a harmonic oscillator. We obtain the simplest ground state by taking g to 
be a suitable normalising constant C - we’ll consider more elaborate choices 
below. Our present ground-state wavefunction is 

(x|0) = C , e“^ 3+!/2 ^ 4r ®. (3.56) 

In classical physics a particle that moves at speed v perpe ndicula r to a 
uniform magnetic field moves in circles of radius r = mv/QB = \/2 mE/QB = 
\J2E /mu 2 . When E = ^huj this radius agrees with the dispersion rg in 
radius of the Gaussian probability distribution | (ar|0)| 2 that we have just 
derived. 

A wavefunction in the first excited level is 

(x|l) oc (x|a^|0) oc ^ - — ^j- 2 -)e _ “' u / 4rB oc ve~ uv ^ 4rB . (3.57) 

B 

It is easy to see that each further application of (A will introduce an additional 
power of v. so that we have 

(x|n) oc v n e~ uv/4rB = (x - iy) n e- (x2+!/2)/4rB . (3.58) 

We shall see in §7.2.3 that the factor (x — iy) n implies that the particle has n 
units of angular momentum about the origin. 8 We can also show from this 
formula that for large n the expectation of the orbital radius increases as 
the square root of the energy, in agreement with classical mechanics (Prob¬ 
lem 3.20). 

Displacement of the gyrocentre A particle in the state (3.58) gyrates 
around the origin of the xy plane. Since the underlying physics (unlike the 
Hamiltonian 3.31) is invariant under displacements within the xy plane, there 
must be a ground-state ket in which the particle gyrates around any given 
point. Hence, every energy level associated with motion in a uniform mag¬ 
netic field is highly degenerate: it has more than one linearly independent 
eigenket. 

It was our choice of magnetic vector potential A that made the origin 
have a special status in H: the potential we used can be written A = |B x x. 
The choice A = — ^x • (B x a), where a is any vector, makes the gauge 
transformation from A to A' = A — |B x a, so if A = |B x x, then 
A' = ±B x (x — a). If we replace A in H with A', it will prove expedient 
to redefine n x ,n y such that the wavefunctions that are generated by the 
procedure we used before will describe a particle that gyrates around x = a 
instead of the origin. Thus in the gauge A', the wavefunction of a ground- 
state particle that gyrates around x = a is 

(x|0',a) = Ce -(x-a)2/4TB . (3.59) 

We can use the theory of gauge transformations that we derived in §3.3.1 to 
transform this back to our original gauge A. The result is 

(x|0, a) = C e i Q( Bxa ) x / 2h e -(x^a) 2 /4^ _ (3.60) 

8 This statement follows because in spherical polar coordinates (x — i y) n = r n e~ in< P. 
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This procedure is easily generalised to the determination of the wavefunction 
of the n tb Landau level for gyration about x = a. 

A complete set of mutually orthogonal stationary states is needed if we 
want to expand a general state of motion in a magnetic field as a linear 
combination of stationary states. Wavefunctions such as (3.56) and (3.60) 
that differ only in their gyrocentres are not orthogonal, so it is not convenient 
to combine them in a set of basis states. To obtain a complete set of mutually 
orthogonal states we can either return to equation (3.55) and set g(u) = 
u, u 2 ,..., etc., or we can step still further back to equations (3.47) and 
note that we started with four operators, x, y, p x and p y , but expressed the 
Hamiltonian H xy in terms of just two operators tt x and tt Vi which we then 
packaged into the ladder operators a and ah 

Consider the operators 


= 


Px - 2 mw y 
y/mujh 


£y — 


p y + \mwx 
y/muih 


(3.61) 


They differ from the operators tt x and n y defined by equations (3.47b) only 
in a sign each, and they commute with them. For example 

[£ x , 71 z ] = —^—r\Px ~ \muy,p x + \muy] = 0 

mcoR (3.62) 

[£x,7Ty] = - \ m uy,p y - \mux\ = 0. 

Consequently they commute with H xy . On the other hand, [£z,£ y ] = —i, so 
from these operators we can construct the ladder operators 

^ ^ => [M f ] =![£*,&] = 1- (3-63) 

b' = ^x + i&) 


Since these ladder operators commute with H xy , we can find a complete set 
of mutual eigenkets of tfb and H xy . 

In the position representation the new ladder operators are 


,r B /d V \ _ t ~ _-L!L( — _ 

y/2 V du 4 r 2 B ) ’ ^2 V dv 4r 2 B ) 


(3.64) 


When we apply b to the ground-state wavefunction (x|0) = Ce uv / 4r B 
(eq. 3.56), we find 


6(x|0) 


.Cr B ( d 
1 y/2 \du 


4 r%) 


e ~uv/4r% __ q 


(3.65) 


Thus Ce uv / 4r B ig annihilated by both a and b. When we apply to this 
wavefunction we obtain 


6 t (x|0) 


.Cr B t d 
yj2 \dv 


u 

4 r 2 

B 


) 


e ~uv/4r 2 B 


oc ue 


—uv/Ar 


2 

B 


(3.66) 


which is the wavefunction we would have obtained if we had set g(u ) = u in 
equation (3.55). In fact it’s clear from equation (3.66) that every application 
of 6' will introduce an additional factor u before the exponential. Therefore 
the series of ground-state wavefunctions that are obtained by repeatedly 
applying to e~ uv ! ArB are all of the form 


(i>t)”(x|0) oc u”e -w, / 4r ®. 


(3.67) 


The only difference between this general ground-state wavefunction 9 and the 
wavefunction of the n th excited Landau level (eq. 3.57) is that the former has 
u n rather than v n in front of the exponential. For the physical explanation 
of this result, see Problem 3.24. 


9 The absolute value of the real part of this is shown for n = 4 on the front cover. 
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3.3.3 Aharonov-Bohm effect 

Imagine a very long, thin solenoid that runs parallel to the z axis. There is 
a strong magnetic field inside the solenoid, but because the solenoid is long, 
the field lines are extremely thinly spread as they return from the solenoid’s 
north pole to its south pole, and outside the solenoid the magnetic field is 
negligibly small. In the limit that the solenoid becomes infinitely thin, a 
suitable vector potential for the field is 


A 


$ 

2tt 


(-JL — o) 

V r 2 ’ r 2 ’ ) ’ 


(3.68) 


where r = \J x 2 + y 2 and $ is the magnetic flux through the solenoid. To 
justify this statement we note that when we integrate A around a circle 
of radius r, the integral evaluates to $ independent of r. But by Stokes’ 
theorem 


dx ■ A = 


V x A = / d 2 x • B. 


(3.69) 


Thus $ units of flux run along the axis r = 0, and there is no flux anywhere 
else. 

Now we place a screen with two slits in the plane y = 0, with the slits 
distance 2s apart and running parallel to the solenoid and on either side of 
it. We bombard the screen from y < 0 with particles that have well defined 
momentum p = pj parallel to the y axis, and we detect the arrival of the 
particles on a screen P that lies in the plane y = L - apart from the presence 
of the solenoid, the arrangement is identical to that of the standard two- 
slit experiment of §2.3.4. Classical physics predicts that the particles are 
unaffected by B since they never enter the region of non-zero B. Aharonov 
& Bohm pointed out 10 that the prediction of quantum mechanics is different. 

Consider the function 

A = —(3.70) 
where 8 is the usual polar angle in the xy plane. Since 9 = arctan (y/x), 


d8_ 

dx 



and 


d8 

dy 


x 

~2 5 
r z 


and the gradient of A is 


VA = — 


$ 

2m' 2 


(~y,x, 0) 


(3.71) 


(3.72) 


which is minus the vector potential A of equation (3.68). So let’s make a 
gauge transformation from A to A' = A + VA. In this gauge, the vector 
potential vanishes, so the Hamiltonian is just that of a free particle, p 2 /2m. 
Hence the analysis of §2.3.4 applies, and the amplitude to pass through a 


10 Y. Aharonov & D. Bohm, Phys. Rev. 115, 485 (1959) 
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given slit and arrive at a point on the screen P with coordinate x has a phase 
(j) that is proportional to x (cf eq. 2.71): 


ct>i = constant ± ——, (3.73) 

lllj 

where the plus sign applies for one slit and the minus sign for the other. 

Our choice of gauge leads to a tricky detail, however. We require A to be 
single valued, so we must restrict the polar angle 9 to a range 27 t in extent. 
Consequently, 9 and A must be somewhere discontinuous. We get around 
this problem by using different forms of A, and therefore different gauges, to 
derive the amplitudes for arrival at x from each slit. For slit Si at x = +s, 
we take —tt < 9 < n, and for S 2 at x = — s we take —27r < 9 < 0. With these 
choices the discontinuity in A occurs where the electron does not go, and A is 
always the same in the region y < 0 occupied by the incoming electron beam. 
Consequently, the amplitudes for arrival at a point x on the screen P are the 
same as if the solenoid were not there. However, before we can add the 
amplitudes and calculate the interference pattern, we have to transform to a 
common gauge. The easiest way to do this is to transform the amplitude for 
S 2 to the gauge of Si. The function that effects the transformation between 
the gauges is A = Ai — A 2 , where A, is the gauge function used for slit S,;. At 
any point of P the two forms of 9 differ by 27r, so A = —<F. Therefore equation 
(3.40) requires us to multiplying the amplitude for S 2 by exp(— iQQ/h), and 
the quantum interference term (1.15) becomes 


constant x e 1 ^ 1 ^ 2+( 3 ®/ h ) ^ ex p 



(3.74) 


The term Q4> in the exponential shifts the centre of the interference pattern 
by an amount Ax = —LQQ/2ps, so by switching the current in the solenoid 
on and off you can change the interference pattern that is generated by par¬ 
ticles that never enter the region to which B is confined. This prediction was 
first confirmed experimentally by R.G. Chambers. 11 Although this effect has 
no counterpart in classical mechanics, curiously the shift Ax is independent 
of h and does not vanish in the limit h —> 0, which is often regarded as the 
classical limit. 


Problems 


3.1 After choosing units in which everything, including K = 1, the Hamilto¬ 
nian of a harmonic oscillator may be written H = ^(p 2 +x 2 ), where [ x,p\ = i. 
Show that if \ip) is a ket that satisfies H\ip) = E\tp), then 

\{p 2 + x 2 ){x =F ipM) = (E± l)(x =F ip)\ip)- (3.75) 


Explain how this algebra enables one to determine the energy eigenvalues of 
a harmonic oscillator. 

3.2 Given that A\E„) = a\E„-i) and E n = (n+ \)Tiw, where the annihi¬ 
lation operator of the harmonic oscillator is 


A = 


mu>x + ip 
a/ 2 mTiuj 


(3.76) 


show that a = sfn. Hint: consider |A|E n )| 2 . 

3.3 The pendulum of a grandfather clock has a period of Is and makes 
excursions of 3 cm either side of dead centre. Given that the bob weighs 
0.2 kg, around what value of n would you expect its non-negligible quantum 
amplitudes to cluster? 


11 Phys. Rev. Lett. 5, 3 (1960) 
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Figure 3.5 The wavefunctions (x\2) 
and (a) 40) of two stationary states of 
a harmonic oscillator. 


3.4 Show that the minimum value of E(p,x) = p 2 /2m + i muj 2 x 2 with 

respect to the real numbers p, x when they are constrained to satisfy xp = \Ti, 
is Explain the physical significance of this result. 

3.5 How many nodes are there in the wavefunction ( x\n) of the n th excited 
state of a harmonic oscillator? 


3.6 Show that in terms of a harmonic oscillator’s characteristic length £ = 
y/h/2muj the ladder operators can be written 




(3.77) 


x „ d , .j. x , d 

2£ dx 2£ dx 

Hence show that the wavefunction of the second excited state is (x|2) = 
constant x (x 2 /£ 2 — \)e~ x Z 4 ^ and find the normalising constant. 

3.7 Explain why the wavefunction (x\n) of the oscillator’s n th stationary 
state must have the form 


(x\n) = H n (x/£)e ^Z 4 ^, 


(3.78) 


where H n is an n th -order (‘Hermite’) polynomial. By casting the equations 
A\n) = y/n\n — 1) and A* \n— 1) = yjn\n) in the x-representation, show that 

H' n (x/£) = y/nH„-i(x/£) and y/nH n (x/£) = ^H n ^ 1 (x/£) - H' n _ x (x/£). 

(3.79) 

and thus that 

\JnH n (x/i) = - ( H n _i(x/£) - y/n- 1 H n - 2 (x/£). (3.80) 


Given that Ho = (2n£ 2 ) 4 Z 4 and H\(y) = y/(2'K£ 2 ) 1 / i , use this recurrence 
relation to reproduce the plots of the wavefunctions (x\2) and (x|40) shown 
in Figure 3.5. Explain the physical significance of the vertical arrows. Why 
is the amplitude of (x|40) largest near the right arrow? 


3.8 Use 

x = J-^—(A + A^) = £(A + A^) (3.81) 

V 2mco 

to show for a harmonic oscillator that in the energy representation the op¬ 
erator x is 

/ 0 v/1 0 0 ... \ 

y/1 0 V2 0 

0 y/2 0 V3 

V3 ... 


Xj k — £ 




0 y/n^T 
yfn ^T 0 

y/n 


y/n 

0 y/n +1 
y/n + 1 0 


Calculate the same entries for the matrix pjk- 


-/ 

(3.82) 
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3.9 Show that the momentum operator of a harmonic oscillator can be 
expressed in terms of the creation and annihilation operators as 

P ='£(■*'-A) Where (3.83) 

Hence show that 

(0b 2 |0) = (J^j . (3.84) 

How does this result relate to the physics of a free particle discussed in §2.3.3? 

3.10 At t = 0 the state of a harmonic oscillator, mass m frequency oj. is 

l^^liV-D + ^liV}. (3.85) 

Show that subsequently 


(x) t = VN£cos(ujt) where l 



(3.86) 


Interpret this result physically. What does this example teach us about the 
validity of classical mechanics? 

Show that a classical oscillator with energy (N + ^)huj has amplitude 


•^max 


= 2 v^+H 


(3.87) 


To explain the discrepancy between these results, consider the case in 
which initially 


1 JV+A'-l 

W = £ '*> 

v k=N 


(3.88) 


with N K 1. Show that then (x) t ~ 2y/N£cos(u>t) consistent with 
classical physics. 

3.11* By expressing the annihilation operator A of the harmonic oscillator 
in the momentum representation, obtain (p|0). Check that your expression 
agrees with that obtained from the Fourier transform of 


<z|0> 


1 0 ~x 2 /U 2 

( 2 tt £ 2 ) 1 / 4 


where 



(3.89) 


3.12 Show that for any two N x N matrices A, B , trace)[A, H]) = 0. Com¬ 
ment on this result in the light of the results of Problem 3.8 and the canonical 
commutation relation [x,p] = i Ti. 

3.13* A Fermi oscillator has Hamiltonian H = f 'f■ where / is an oper¬ 
ator that satisfies 


f = 0 ; // t + / t / = I- (3.90) 

Show that H 2 = H : and thus find the eigenvalues of H. If the ket |0) 
satisfies H |0) = 0 with (0|0) = 1, what are the kets (a) |a) = /|0), and (b) 
|6) = J+IO)? 

In quantum field theory the vacuum is pictured as an assembly of os¬ 
cillators, one for each possible value of the momentum of each particle type. 
A boson is an excitation of a harmonic oscillator, while a fermion in an ex¬ 
citation of a Fermi oscillator. Explain the connection between the spectrum 
of f'f and the Pauli principle. 
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3.14 In the time interval (t + 6t,t) the Hamiltonian H of some system 
varies in such a way that |i?|^>)| remains finite. Show that under these 
circumstances \ip) is a continuous function of time. 

A harmonic oscillator with frequency u> is in its ground state when the 
stiffness of the spring is instantaneously reduced by a factor f 4 < 1, so its 
natural frequency becomes f 2 uj. What is the probability that the oscillator 
is subsequently found to have energy | hf 2 co? Discuss the classical analogue 
of this problem. 

3.15* P is the probability that at the end of the experiment described in 
Problem 3.14, the oscillator is in its second excited state. Show that when 
/ = P = 0.144 as follows. First show that the annihilation operator of 
the original oscillator 

41 = ± {(r 1 + w + (r 1 - /)A't}, ( 3 . 9 i) 

where A' and A'^ are the annihilation and creation operators of the final 
oscillator. Then writing the ground-state ket of the original oscillator as a 
sum |0) = n c n \n') over the energy eigenkets of the final oscillator, impose 
the condition A|0) = 0. Finally use the normalisation of |0) and the orthogo¬ 
nality of the In'). What value do you get for the probability of the oscillator 
remaining in the ground state? 

Show that at the end of the experiment the expectation value of the 
energy is 0.2656 Tiui. Explain physically why this is less than the original 
ground-state energy i Tito. 

This example contains the physics behind the inflationary origin of the 
Universe: gravity explosively enlarges the vacuum, which is an infinite collec¬ 
tion of harmonic oscillators (Problem 3.13). Excitations of these oscillators 
correspond to elementary particles. Before inflation the vacuum is unexcited 
so every oscillator is in its ground state. At the end of inflation, there is non- 
negligible probability of many oscillators being excited and each excitation 
implies the existence of a newly created particle. 

3.16* In terms of the usual ladder operators A, A', a Hamiltonian can be 
written 

H = nA^A + A(A + A f ). (3.92) 

What restrictions on the values of the numbers /r and A follow from the 
requirement for H to be Hermitian? 

Show that for a suitably chosen operator B, H can be rewritten 

H = B + constant. (3.93) 

where [B, B^] = 1. Hence determine the spectrum of H. 

3.17* Numerically calculate the spectrum of the anharmonic oscillator shown 
in Figure 3.2. From it estimate the period at a sequence of energies. Compare 
your quantum results with the equivalent classical results. 

3.18* Let B = cA+sAl , where c = cosh (9, s = sinh 6 with 9 a real constant 
and A , A 1 are the usual ladder operators. Show that [B, B^] = 1. 

Consider the Hamiltonian 

H = eA*A + ±\(A*A*+AA), (3.94) 

where e and A are real and such that e > A > 0. Show that when 

ec — As = Ec ; Ac — es = Es (3.95) 

with E a constant, [B,H\ = EB. Hence determine the spectrum of H in 
terms of e and A. 
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3.19 This problem is all classical electromagnetism, but it gives physical 
insight into quantum physics. It is hard to do without a command of Carte¬ 
sian tensor notation (Appendix B). A point charge Q is placed at the origin 
in the magnetic field generated by a spatially confined current distribution. 
Given that 


E = Q - 

47T6q r 3 


(3.96) 


and B = V x A with V • A = 0, show that the field’s momentum 


P = eo 


d 3 xE x B = QA(0). 


(3.97) 


Write down the relation between the particle’s position and momentum and 
interpret this relation physically in light of the result you have just obtained. 

Hint: write E = — (Q/47reo)Vr _1 and B = V x A, expand the vector 
triple product and integrate each of the resulting terms by parts so as to 
exploit in one V ■ A = 0 and in the other V 2 r -1 = — 47r<5 3 (r). The tensor 
form of Gauss’s theorem states that f d 3 xV;T = j> cl 2 Si T no matter how 
many indices the tensor T may carry. 


3.20 From equation (3.58) show that the the normalised wavefunction of 
a particle of mass m that is in the rc th Landau level of a uniform magnetic 
field B is 


(x|n) = 


re 


-r 2 /4r^ e ~in 


2 (n+l)/ 2 A 


n+1 


(3.98) 


where rs = \Jh/QB. Hence show that the expectation of the particle’s 
gyration radius is 


{r) n = (n\r\n) = y/2 


(n+ i)! 


-tb- 


(3.99) 


Show further that 

^ ln (r) ra _ J_ 
Sn 2 n 


(3.100) 


and thus show that in the limit of large n, (r) oc y/E, where E is the energy 
of the level. Show that this result is in accordance with the correspondence 
principle. 


3.21 Show that in the gauge in which the magnetic vector potential is 
A = ±B x x the wavefunction of the n th Landau level of gyration about the 
point a is 


(x|n,a) =e i<3(Bxa) ' x/2?i {(^-a ;c )-i( 2 /-a y )} T! e- |x - a|2/4r B. (3.101) 


3.22 A particle of charge Q is confined to move in the xy plane, with 
electrostatic potential <f> = 0 and vector potential A satisfying 


V x A = (0,0, B). 


(3.102) 


Consider the operators p x , p v , R x and R y , defined by 


P = 


QB 


x (p - QA) 


and R = r — p, 


(3.103) 


where r and p are the usual position and momentum operators, and e~ is 
the unit vector along B. Show that the only non-zero commutators formed 
from the x- and y-conrponents of these are 


[Px,P v \ = i r% and [R x ,R y \ = -ir|, 


(3.104) 
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where r 2 B = Ti/QB. 

The operators a, a\ b and are defined via 

a =—rl — (px + ipy) and b = — (R y + iR x ). (3.105) 

V2 r B V2r s 

Evaluate [a, af] and [ 6 , b^]. Show that for suitably defined oj, the Hamiltonian 
can be written 

H = Tiu> (a}a + 5 ) . (3.106) 

Given that there exists a unique state | ip) satisfying 

a|V>) = b\ip) = 0, (3.107) 


what conclusions can be drawn about the allowed energies of the Hamiltonian 
and their degeneracies? What is the physical interpretation of these results? 


3.23 Using cylindrical polar coordinates (R, </>, z), show that the probability 
current density associated with the wavefunction (3.98) of the n th Landau 
level is 


T/ „, rii? 2n - 1 e- jR2 / 2r B / B 2 \ 

( } “ _ 2 n+1 Trn\mr 2 B +2 V + 2^J ^ 


(3.108) 


where rs = sJh/QB. Plot J as a function of R and interpret your plot 
physically. 


3.24 Determine the probability current density associated with the n th 
Landau ground-state wavefunction (3.67) (which for n = 4 is shown in Fig¬ 
ure Landaufig). Use your result to explain in as much detail as you can why 
this state can be interpreted as a superposition of states in which the electron 
gyrates around different gyrocentres. Hint: adapt equation (3.108). 

Why is the energy of a gyrating electron incremented if we multiply the 
wavefunction e ~( m ^/ 4n ) r by v n = (x — i y) n but not if we multiply it by 
u n = (x + i y) n l 



4 

Transformations & Observables 


In §2.1 we associated an operator with every observable quantity through 
a sum over all states in which the system has a well-defined value of the 
observable (eq. 2.5). We found that this operator enabled us to calculate 
the expectation value of any function of the observable. Moreover, from the 
operator we could recover the observable’s allowed values and the associ¬ 
ated states because they are the operator’s eigenvalues and eigenkets. These 
properties make an observable’s operator a useful repository of information 
about the observable, a handy filing system. But they do not give the opera¬ 
tor much physical meaning. Above all, they don’t answer the question ‘what 
does an operator actually do when it operates?’ In this chapter we answer 
this question. In the process of doing this, we will see why the canonical 
commutation relations (2.54) have the form that they do, and introduce the 
angular-momentum operators, which will play important roles in the rest of 
the book. 


4.1 Transforming kets 

When one meets an unfamiliar object, one may study it by moving it around, 
perhaps turning it over in one’s hands so as to learn about its shape. In §1.3 
we claimed that all physical information about any system is encapsulated 
in its ket \ip), so we must learn how | ip) changes as we move and turn the 
system. 

Even the simplest systems can have orientations in addition to posi¬ 
tions. For example, an electron, a lithium nucleus or a water molecule all 
have orientations because they are not spherically symmetric: an electron 
is a magnetic dipole, a 7 Li nucleus has an electric quadrupole, and a water 
molecule is a V-shaped thing. The ket \tp) that describes any of these objects 
contains information about the object’s orientation in addition to its position 
and momentum. In the next subsection we shall focus on the location of a 
quantum system, but later we shall be concerned with its orientation as well, 
and in preparation for that work we explicitly display a label p of the sys¬ 
tem’s orientation and any other relevant properties, such as internal energy. 
For the moment /z is just an abstract symbol for orientation information; the 
details will be fleshed out in §7.1. 
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Figure 4.1 A spherical wavefunction 
and its displaced version. 


4.1.1 Translating kets 

We now focus on the location of our system. To keep track of this we use a 
coordinate system Eo whose origin is some well-defined point, say the centre 
of our laboratory. We can investigate \ip) by expanding it in terms of a 
complete set of eigenstates |x, ff), where x is the position vector of the centre 
of mass and g represents the system’s orientation. The amplitude for finding 
the system’s centre of mass at x with the orientation specified by p is 

^(x) = <x,/i|V>). (4.1) 

If we know all the derivatives of the wavefunction ^ at a position x, Taylor’s 
theorem gives the value of the wavefunction at some other location x — a as 

., , r d i ( o \ 2 

- a) = ^4 _ a - _ + _ (^ a . _ j -... 

= exp (~a- ^ Vv( x ) 

= (x, fj\ exp (—i—d-) | i>). 

This equation tells us that in the state \ijf), the amplitude to find the system 
at x — a with orientation etc /r is the same as the amplitude to find the 
system with unchanged orientation at x when it is a different state, namely 

\tp') = U (a)\ip) where 17(a) = exp(—ia • p/h). (4.3) 

In this notation, equation (4.2) becomes 



ipn ( x — a) = (x, /j,\U(a.)\^>) = (x,/z| ip') = V^( x ), (4-4) 


so, as Figure 4.1 illustrates, the wavefunction ip 1 for \ijj') is the wavefunction 
we would expect for a system that is identical with the one described by 
\ip) except for being shifted along the vector a. We shall refer to this new 
system as the translated or transformed system and we shall say that the 
translation operator operator 17(a) translates \if>) through a even though 
| ip) is not an object in real space, so this is a slight abuse of language. 

The ket \ip') of the translated system is a function of the vector a. It 
is instructive to take its partial derivative with respect to one component of 
a, say a x . Evaluating the resulting derivative at a = 0, when \i/j') = |^), we 


find 


~aZ = ~d^ = rM) ' 


(4.5) 


Thus the operator p x gives the rate at which the system’s ket changes as 
we translate the system along the x axis. So we have answered the question 
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Box 4.1: Passive transformations 

We can describe objects such as atoms equally well using any coordinate 
system. Imagine a whole family of coordinate systems set up throughout 
space, such that every physical point is at the origin of one coordinate 
system. We label by E y the coordinate system whose origin coincides 
with the point labelled by y in our original coordinate system So, and we 
indicate the coordinate system used to obtain a wavefunction by making 
y a second argument of the wavefunction; i/v( x: y) is the amplitude to 
find the system at the point labelled x in E y . Because the different 
coordinate systems vary smoothly with y, we can use Taylor’s theorem 
to express amplitudes in, say, E a+y in terms of amplitudes in E y . We 
have 

Vv(x;a + y) = exp • -^-^ M (x;y). (1) 

Now V’m( x ; a ) = ?/v(x+ a ; 0) because both expressions give the amplitude 
for the system to be at the same physical location, the point called x in 
E a and called x + a in Eo- Then equations (4.4) and (1) give 

( x , M;0|C/(—a)|?/’) = W(x + a;0) = i/v( x ; a )> ( 2 ) 

where again |x, fi; 0) indicates the state in which the system is located 
at the point labelled by x in E 0 . This equation tells us that \i/j) has 
the same wavefunction in E a that |^) = U(—a)\if>) has in Eo- There¬ 
fore, moving the origin of our coordinates through a vector a has the 
same effect on an arbitrary state’s wavefunction as moving the system 
itself through —a. Physically moving the system is known as an active 
transformation, whereas leaving the state alone but changing the co¬ 
ordinate system is called a passive transformation. The infinitesimal 
vectors required to make logically equivalent active and passive transfor¬ 
mations differ in sign. This sign difference reflects the fact that if you 
move backwards, the world around you seems to move forwards; hence 
moving the origin of one’s coordinates back by 6a has the same effect as 
moving the system forward by da. In this book we confine ourselves to 
active transformations. 


posed above as to what an observable’s operator actually does in the case of 
the momentum operators. 

Equation (2.78) enables us to expand a state of well-defined position x 0 
in terms of momentum eigenstates. We have 

I x o,m) = y"d 3 p |p,^)(p,Ai|xo,At> = j d 3 pe -1Xo ' P//ri |p,/i). (4.6) 

Applying the translation operator, we obtain with (4.3) 

U(a) |x 0 ,/x) = J cl :i pe" IX0 ' p/a t/(a)|p,/x) 

= ^/ d3 P e ” i(xo+a)p/?i |P^) (4 ‘ 7) 

= |x 0 +a,/z), 

which is a new state, in which the system is definitely located at x 0 + a, as 
we would expect. 


4.1.2 Continuous transformations and generators 

In §1.3 we saw that the normalisation condition (tp\ip) = 1 expresses the 
fact that when we make a measurement of any observable, we will measure 
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Box 4.2: Operators from expectation values 

In this box we show that if 

(^\A\ip) = (it>\B\vl>), (1) 

for every state | t/>), then the operators A and B are identical. We set 
\ip) = |<^} + A|x), where A is a complex number. Then equation (1) implies 

A mA\x) (<P\B\x)) = A* ((x\B\<t>) - (x\m) • (2) 

Since equation (1) is valid for any state | ip), equation (2) remains valid as 
we vary A. If the coefficients of A and A* are non-zero, we can cause the 
left and right sides of (2) to change differently by varying the phase of 
A; they can be equal irrespective of the phase of A only if the coefficients 
vanish. This shows that (x|4|</>) = (x\B\<j)) for for arbitrary states |</>} 
and |x), from which it follows that A = B. 


some value; for example, if we determine the system’s location, we will find 
it somewhere. The normalisation condition must be unaffected by any trans¬ 
formation that we make on a system, so the transformation operator 1 U 
must have the property that for any state | ip) 

1 = <V>V) = (ip\ U'UW). (4.8) 

From this requirement we can infer by the argument given in Box 4.2 (with 
A = WU and B = /, the identity operator) that WU = /, so W = U~ x . 
Operators with this property are called unitary operators. When we trans¬ 
form all states with a unitary operator, we leave unchanged all amplitudes: 
i/j') = {(/)\ip) for any states | $) and \i/j). 

Exactly how we construct a unitary operator depends on the type of 
transformation we wish it to make. The identity operator is the unitary 
operator that represents doing nothing to our system. The translation oper¬ 
ator U(a) can be made to approach the identity as closely as we please by 
diminishing the magnitude of a. Many other unitary operators also have a 
parameter 9 that can be reduced to zero such that the operator tends to the 
identity. In this case we can write for small 59 

U(59) = I-i69r + 0(59) 2 , (4.9) 

where the factor of i is a matter of convention and r is an operator. The 
unitarity of U implies that 

I = U ] ( 59)U{59) = I + i59 (r f - r) + 0(<56» 2 ). (4.10) 

Equating powers of 69 on the two sides of the equation, we deduce that r 
is Hermitian, so it may be an observable. If so, its eigenkets are states in 
which the system has well-defined values of the observable r. 

We obtain an important equation by using equation (4.9) to evaluate 
1^//} s U(59)\ip). Subtracting \tp) from both sides of the resulting equation, 
dividing through by 59 and proceeding to the limit 59 —> 0, we obtain 

= ( 4 - 11 ) 

Thus the observable r gives the rate at which \ip) changes when we increase 
the parameter 9 in the unitary transformation that r generates. Equation 
(4.5) is a concrete example of this equation in action. 

1 We restrict ourselves to the case in which the operator U is linear, as is every operator 
used in this book. In consequence, we are unable to consider time reversal. 
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A finite transformation can be generated by repeatedly performing an 
infinitesimal one. Specifically, if we transform N times with U(59) with 
69 = 0/N, then in the limit N —> oo we have 

/ 0 \ N 

C/^.Jim (f-i-rj =e-- (4.12) 

This relation is clearly a generalisation of the definition (4.3) of the trans¬ 
lation operator. The Hermitian operator r is called the generator of both 
the unitary operator U and the transformations that U accomplishes; for 
example, p/7i is the generator of translations. 


4.1.3 The rotation operator 

Consider what happens if we rotate the system. Whereas in §4.1.1 we con¬ 
structed a state \rp') = U(a)\i/j) that differed from the state \ip) only in a 
shift by a in the location of the centre of mass, we now wish to find a ro¬ 
tation operator that constructs the state | ip') that we would get if we could 
somehow rotate the apparatus on a turntable without disturbing its internal 
structure in any way. Whereas the orientation of the system is unaffected 
by a translation, it will be changed by the rotation operator, as is physically 
evident if we imagine turning a non-splierical object on a turntable. 

From §4.1.2 we know that a rotation operator will be unitary, and have a 
Hermitian generator. Actually, we expect there to be several generators, just 
as there are three generators, p x /h, Py/h and Pz/h , of translations. Because 
there are three generators of translations, three numbers, the components of 
the vector a in equation (4.3), are required to specify a particular translation. 
Hence we anticipate that the number of generators of rotations will equal 
the number of angles that are required to specify a rotation. Two angles are 
required to specify the axis of rotation, and a third is required to specify 
the angle through which we rotate. Thus by analogy with equation (4.3), we 
expect that a general rotation operator can be obtained by exponentiating 
a linear combination of three generators of rotations, and we write 

U{ol) = exp(—ia • J). (4-13) 

Here a is a vector that specifies a rotation through an angle |ct| around 
the direction of the unit vector a, and J is comprised of three Hermitian 
operators, J x , J y and J z . In the course of this chapter and the next it will 
become clear that the observable associated with J is angular momentum. 
Consequently, the components of J are called the angular-momentum op¬ 
erators. 

The role that the angular momentum operators play in rotating the 
system around the axis a is expressed by rewriting equation (4.11) with 
appropriate substitutions as 


.m 

da 


= a 


m- 


(4.14) 


4.1.4 Discrete transformations 

(a) The parity operator Not all transformations are continuous. In 
physics, the most prominent example of a discrete transformation is the 
parity transformation V, which swaps the sign of the coordinates of all 
spatial points; the action of V on coordinates is represented by the matrix 

-10 0 \ 

0 -1 0 j so Px = -x. (4.15) 

0 0 - 1 / 
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Notice that det V = —1, whereas a rotation matrix has detR = +1. In fact, 
any linear transformation with determinant equal to —1 can be written as a 
product of V and a rotation R. 

Let an arbitrary quantum state | ip) have wavefunction Vv( x ) = ( x , M|0}> 
where the label y is the usual shorthand for the system’s orientation. Then 
the quantum parity operator P is defined by 

= (x, mI-PIV’} = VvC^x) = Vv(~ x ) = (“ x > HV’)- (4-16) 

The wavefunction of the new state, \ip') = P\ip), takes the same value at x 
that the old wavefunction does at —x. Thus, when the system is in the state 
P\ip)i it has the same amplitude to be at x as it had to be at — x when it was 
in the state \ip). The orientation and internal properties of the system are 
unaffected by P. The invariance of orientation under a parity transformation 
is not self evident, but in §4.2 we shall see that it follows from the rules that 
govern commutation of P with x and J. 

Applying the parity operator twice creates a state | ip") = P\ip') = P 2 \ip) 
with wavefunction 

V^'( x ) = (x, iA p W) = (~ x > = (-x, n\ p H) = < x > mI0) (41?) 

= ^( x )- 

Hence P 2 = 1 and an even number of applications of the parity operator 
leaves the wavefunction unchanged. It also follows that P = P _1 is its own 
inverse. 

P is also Hermitian: 


(01PW* = f d 3 x^]((0|x, y\P\ip))* 

= /d 3x ^«0|x,/i)(-x,/r|V>))* 

= /"d 3 x^(0|-x,^)(x,^|P 2 |</>) 

= J d 3 x^(0| - x, /r)(-x, /x|P|0) = WP\(f>), 


(4.18) 


so P^ = P. It now follows that P is unitary 2 because P -1 = P = P0 Hence 
from the discussion of §4.1.2 it follows that transforming all states with P 
will preserve all amplitudes for the system. 

Suppose now that |P) is an eigenket of P, with eigenvalue A. Then 
|P) = P 2 |P) = AP|P) = A 2 |P), so A 2 = 1. Thus the eigenvalues of P are 
±1. Eigenstates of P are said to have definite parity, with |+) = P|+) 
being a state of even parity and |—) = —P|—) being one of odd parity. 

In §3.1 we found that the stationary-state wavefunctions of a harmonic 
oscillator are even functions of x when the quantum number n is even, and 
odd functions of x otherwise. It is clear that these stationary states are also 
eigenstates of P, those for n = 0, 2,4,... having even parity and those for 
n = 1 ,3,5,... having odd parity. 

Mirror operators Systems not infrequently exhibit a mirror symmetry of 
some sort. When they do, it can be helpful to define an operator which trans¬ 
forms any state into the corresponding mirror state. Here’s an illustrative 
concrete example. 

A particle moves in two dimensions, so the amplitudes (x,y\ip) for the 
particle to be found at the location (x, y ) constitute a complete set of am¬ 
plitudes. Let the operator M be such that for any state \tp) 

(x,y\M\ip) = (y,x\ip). (4.19) 


2 Problem 4.9 shows that P is unitary by showing that it has an infinitesimal generator. 
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Figure 4.2 If the coordinates of the 
point marked with an filled circle are 
(x,y), then the coordinates of the 
point marked with an open circle are 
(y,x). The points would be object 
and image if a mirror lay along the 
line y = x. 


That is, in the state M\ip) the amplitude to be at the point (x,y) (marked 
with a filled dot in Figure 4.2) is the same as the amplitude to be at the 
point ( y,x ) (marked by an open dot) when in the state \ip). If there were a 
mirror along the line y = x, the image of a light at (x, y) would be located 
at (y,x). Thus the operator M produces the state we get by mirroring al 
the amplitudes in the line y = x. We leave as an exercise (Problem 4.12) the 
proof that M is a unitary operator, which closely follows the proof we gave 
of the unitarity of P. 

If we mirror a set of amplitudes twice, we obviously recover the original 
amplitudes, so M 2 = 1 and it follows that the eigenvalues of M can only be 
± 1 . 


4.2 Transformations of operators 

When we move an object around, we expect to find it in a new place. Specif¬ 
ically, suppose (^>|x|^>) = x 0 for some state \tp). Since x 0 just labels a spatial 
point, it must behave under translations and rotations like any vector. For 
example, translating a system that is in the state \ip) through a, we obtain 
a new state \ip') which has (^>'|x| ip') = xo + a = (^|x + / a\ip). On the other 
hand, from §4.1.1 we know that {tp'\x\'ip') = (a)xU(a)\t/j). Since these 

expectation values must be equal for any initial state it follows from the 
argument given in Box 4.2 that 

(7^(a)x(7(a) =x + a, (4.20) 


where the identity operator is understood to multiply the constant a. For 
an infinitesimal translation with a — > 5a. we have 17(a) ~ 1 — ia • p/h. So 


x + 6a ~ 




= x — 


j- [x, da ■ p] + O(da) 2 . 


(4.21) 


For this to be true for all small vectors <5a, x and p must satisfy the commu¬ 
tation relations 

[xi,pj] = ihSij (4.22) 

in accordance with equation (2.54). Here we see that this commutation rela¬ 
tion arises as a natural consequence of the properties of x under translations. 
For a finite translation, we can write 


17^ (a) x 17(a) = U^(a)U(a)x + 17^ (a) [x, 17(a)] = x +17^ (a) [x, 17(a)] . (4.23) 


We use equation (2.25) to evaluate the commutator on the right. Treating 
U as the function e~ la ' p ^ h of a • p, we find 


17* (a) x 17(a) 


= x — ^-f7*(a)[x, a • p] 17(a) = x + a 


(4.24) 
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Box 4.3: Rotations in ordinary space 

A rotation matrix R is defined by the conditions R T = R 1 and 
det(R) = +1. If R(a) rotates around the a axis, it should leave this 
axis invariant so R(a)a = a. For a rotation through an angle |a|, 
TrR(a) = 1 + 2 cos |a|. 

Let a be an infinitesi¬ 
mal rotation vector: that is, 
a rotation through a around 
the axis that is in the direc¬ 
tion of the unit vector a. 

We consider the effect of ro¬ 
tating an arbitrary vector v 
through angle a. The com¬ 
ponent of v parallel to a is 
unchanged by the rotation. 
The figure shows the projection of v into the plane perpendicular to 
a. The rotated vector is seen to be the vectorial sum of v and the in¬ 
finitesimal vector a x v that joins the end of v before and after rota¬ 
tion. That is 

V = v + a x v. 



as equation (4.20) requires. 

Similarly, under rotations ordinary spatial vectors have components 
which transform as v —> R(a)v, where R(a) is a matrix describing a rotation 
through angle |a| around the a axis. The expectation values (V ; |x|'0) = Xo 
should then transform in this way. In §4.1.2 we saw that when a system is 
rotated through an angle |a| around the a axis, its ket | ip) should be mul¬ 
tiplied by U(a) = e -1 “' J . If this transformation of | ip) is to be consistent 
with the rotation of the expectation value of x, we need 

R(a)(V'|x|'0) = (y> / |x|y> / ) = (i/j\U jl (a)xU{a)\ip). (4-25) 

Since this must hold for any state | ip), from the argument given in Box 4.2 
it follows that 

R(a)x = t/^(a) x U{cl). (4-26) 

For an infinitesimal rotation, a —> 8a and R(a)x ~ x + 8a x x as is 
shown in Box 4.3, so equation (4.26) becomes 

x + 8a x x ~ (1 + i 8a • J) x (1 — i5a • J) 

= x + i [5a • J, x] + 0(5a) 2 . 

In components, the vector product 8a x x can be written 
(5a x x)^ ^ [ eij\ i 8oij X} 

jk 

where Cijk is the object that changes sign if any two subscripts are inter¬ 
changed and has e xyz = 1 (Appendix B). For example, equation (4.28) gives 
(5a x x)^ — J2 jk e xjk8aj Xk — e X yz8oy x z T e X zy8a z Xy — 8otyZ 8ot z y. The 
z th components of equation (4.27) is 

^2 £ijk.8cxj Xk =i '^2,8aj[Jj,x i \. (4.29) 

jk j 

Since this equation holds for arbitrary 5a, we conclude that the position 
and angular momentum operators Xi and J j must satisfy the commutation 
relation 

[Jij Xj ] — i ^ [ Cijk Xk ■ 
k 

In particular, [J x ,y] = i- and [J z ,x\ = i y, while [J x , x\ = 0. 


(4.27) 

(4.28) 


(4.30) 





66 


Chapter 4: Transformations & Observables 


In fact, if the expectation value of any operator v is a spatial vector, 
then the argument just given in the case of x shows that the components Vi 
must satisfy 

[Ji,Vj\ = iy ^CjjkVk- (4.31) 

k 

For example, since momentum is a vector, equation (4.31) with v = p gives 
the commutation relations of p with J, 

[*A, Pj ] i y ] tjjkPk• (4.32) 

fc 

The product ol • J must be invariant under coordinate rotations because 
the operator U(at) = e -1 “' J depends on the direction a and not on the 
numbers used to quantify that direction. Since a is an arbitrary vector, the 
invariance of ol ■ J under rotations implies that under rotations the compo¬ 
nents of J transform like those of a vector. Hence, in equation (4.31) we can 
replace v by J to obtain the commutation relation 

[./,. rJj ^ = i ^ ' CjjkJk • (4.33) 

k 

In §7.1 shall deduce the spectrum of the angular-momentum operators from 
this relation. 

We now show that J commutes with any operator S whose expectation 
value is a scalar. The proof is simple: (ip\S\il>), being a scalar, is not affected 
by rotations, so 


W\SW) = (i/j\UH a )SU( a M) = (iP\S\i>). (4.34) 

Equating the operators on either side of the second equality and using U~ l = 
U' we have [5, U] = 0. Restricting U to an infinitesimal rotation gives 

S' ~ (1 + i<Sa • J) S (1 - i<5a • J) = S + ifa • [J, S] + O(da) 2 . (4.35) 

Since 8a is arbitrary, it follows that 

[J, S] = 0. (4.36) 

Among other things, this tells us that [J, x • x] = [J, p p] = [J, x • p] =0. It 
is straightforward to check that these results are consistent with the vector 
commutation relations (4.31) (Problem 4.1). It also follows that J 2 = J • J 
commutes with all of the J,;, 


[J, J 2 ] = 0. (4.37) 

Equations (4.33) and (4.37) imply that it is possible to find a complete set 
of simultaneous eigenstates for both J 2 and any one component of J (but 
only one). 

The parity operator Under a parity transform, coordinates behave as 
x —» Vx = —x whereas quantum states transform as | ip) —> | ip') = P\ip), so 

-{tjj\x\ip) = V{ip\x\ip) = (V’ , |x|V’ / ) = {ip\P*xP\ip), (4.38) 

which implies that P^xP = —x or, since P is a unitary operator, 

{x, P} = xP + Px = 0. (4.39) 

Two operators A and B for which { A , B} = 0 are said to anticommute, 
with { A , B} being their anticommutator. The argument we have just given 
for x works with x replaced by any vector operator v, so we always have 


{v, P } = vP + Pv = 0. 


(4.40) 
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This relation contains important information about the action of P. Suppose 
|w) is an eigenstate of a vector operator v with eigenvalues uj such that 
v|u>) = From equation (4.40) we see that 

via;') = v (P|u;)) = — Pv|u>) = — u>P\u>) = — u>\ui') (4-41) 

so the parity-reversed state |u/} = P\iJ) is also an eigenstate of v, but the 
eigenvalue has changed sign. 

Let |±) be states of definite parity such that P|±) = ±|±). With 
equation (4.40) we deduce that 

-<±|v|±> = P<±|V|±) = (±|ptvP|±) = (±) 2 (±|v|±). (4.42) 

Since zero is the only number that is equal to minus itself, all vector operators 
have vanishing expectation value in states of definite parity. More generally, 
if \(j>) and |y) both have the same definite parity, equation (4.40) implies that 
( < /’l v lx) = 0- We’ll use this result in Chapter 9. 

We frequently encounter situations in which the potential energy V (x) 
is an even function of x: V(—x) = H(x). We then say that the potential is 
reflection-symmetric because the potential energy at —x is the same as it 
is at the point x into which —x is mapped by reflection through the origin. 
We now show that in such a case the parity operator commutes with the 
Hamiltonian. For an arbitrary state | ip) consider the amplitude 

(x|PV|V’} = (—x|V|^>) = V(—x)(—x| ip) = V(x)(—x|'i/’}, (4.43a) 

where we have used equation (4.16). On the other hand 

(x.\V P\i/>) = V(x.)(x\P\ip) = V(x)(—x| ip). (4.43b) 

Since x and | ip) are arbitrary, it follows that when V is an even function of 
x, [P, V] = 0. This argument generalises to all operators that carry out a 
transformation that is a symmetry of the potential energy. 

Since the momentum p is a vector operator, Pp = —pP, so 


p 2 P = y^PkPkP = - ^PkPpk = ^2 PpkPk = Pp 2 

k k k 

=► \p\p] = o. 


(4.44) 


Applying these results to the Hamiltonian H = p 2 /2m + V(x) of a particle 
of mass m that moves in a reflection-symmetric potential, we have that 
[H, P] = 0. It follows that for such a particle there is a complete set of 
stationary states of well-defined parity. This fact is illustrated by the case 
of the harmonic oscillator studied in §3.1, and in Chapter 5 it will enable us 
dramatically to simplify our calculations. 

In classical physics, a vector product ax b is a pseudovector; it behaves 
like an ordinary vector under rotations, but is invariant under parity, since 
both a and b change sign. We now show that expectation values of the 
angular momentum operators, (J), are pseudovectors. If u* are components 
of a vector operator, then combining equations (4.31) and (4.40), we obtain 

{P, [vi,Jj]} = i 5> ifc {P,u fc } = 0. (4.45) 

k 

We use the identity (4.76) proved in Problem 4.8 to rewrite the left side of 
this equation. We obtain 


0 = {P, [vi, J,]} = [{P, J/} - {[P, Jj],Vi} = -{[P, Jj],Vi}. (4.46) 
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Hence the operator [P, Jj] anticommutes with any component of an arbitrary 
vector. Since P is defined to have precisely this property, [P, Jj] must be 
proportional to P, that is 

[P,Jj] = AP, (4.47) 

where A must be the same for all values of j because the three coordinate 
directions are equivalent. Under rotations, the left side transforms like a vec¬ 
tor, while the right side is invariant. This is possible only if both sides vanish. 
Hence the parity operator commutes with all three angular-momentum op¬ 
erators. It now follows that 

w\m) = = mm, (4.4s) 

so (J) is unchanged by a parity transformation, and is a pseudovector. 
Mirror operators In §4.1.4 we introduced a typical mirror operator M. 
To discover how M interacts with the position operators x and y we argue 
that for any state | ip) 


{ip\M^xM\ip) = {il>\y\ip). (4.49) 

That is, in the state M\tp) the expectation of x must be equal to the ex¬ 
pectation value of y in the state \if>) - the truth of this statement follows 
immediately from the definition (4.19) of the state M\ip). Since equation 
(4.49) holds for arbitrary | ip), we can infer the operator equation 

M^xM = y => xM = My, (4.50) 

where the second equation follows by multiplying both sides of the first equa¬ 
tion by M and using the unitarity condition MM' = I. In the same way we 
can show that Mx = yM , p x M = Mp y and p y M = Mp x . 


4.3 Symmetries and conservation laws 

Time changes states: in a given time interval t, the natural evolution of the 
system causes any state |^>, 0) to evolve to another state | ip,t). Equation 
(2.32) gives an explicit expression for | ip,t). It is easy to see that with the 
present notation this rule can be written 

foM>=e" iiH / R hM>, (4.51) 

where H is the Hamiltonian. The time-evolution operator 

U(t) = e - iHt/h (4.52) 

is unitary, as we would expect. 3 

Now suppose that the generator r of some displacement (a translation, 
a rotation, or something similar) commutes with H. Since these operators 
commute, their exponentials U(6) (eq. 4.12) and U(t) also commute. Con¬ 
sequently, for any state \ip) 

U{0)U{tm = U(t)U(9M- (4.53) 


3 The similarity between equations (4.52) and the formula (4.12) for a general unitary 
transformation suggests that H is the generator of transformations in time. This is not 
quite true. If we were to push the system forward in time in the same way that we 
translate it in x, we would delay the instant at which we would impose some given initial 
conditions, with the result that it would be less evolved at a given time t. The time- 
evolution operator, by contrast, makes the system older. Hence H is the generator of 
transformations backwards in time. 
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The left side is the state you get by waiting for time t and then displacing, 
while the right side is the state obtained by displacing first and then waiting. 
So the equation says that the system evolves in the same way no matter where 
you put it. That is, there is a connection between commuting observables and 
invariance of the physics under displacements. Moreover, in §2.2.1 we saw 
that when any operator Q commutes with the Hamiltonian, the expectation 
value of any function of Q is a conserved quantity, and that in consequence, 
a system that is initially in an eigenstate \qi) of Q remains in that eigenstate. 
So whenever the physics is unchanged by a displacement, there is a conserved 
quant ity. 

If [p x , H] = 0, this argument implies that the system evolves in the same 
way wherever it is located. We say that the Hamiltonian is translationally 
invariant. It is a fundamental premise of physics that empty space is the same 
everywhere, so the Hamiltonian of every isolated system is translationally 
invariant. Consequently, when a system is isolated, the expectation value of 
any function of the momentum operators is a conserved quantity, and, if the 
system is started in a state of well-defined momentum, it will stay in that 
state. This is Newton’s first law. 

If [J Zl H] = 0, we say that the Hamiltonian is rotationally invariant 
around the 2 axis, and our argument implies that the system evolves in the 
same way no matter how it is turned around the 2 axis. The expectation 
value of any function of J z is constant, and if the state is initially in an 
eigenstate of J z with eigenvalue m, it will remain in that state. Consequently, 
m is a good quantum number. In classical physics invariance of a system’s 
dynamics under rotations around the ^ axis is associated with conservation 
of the 2 component of the system’s angular momentum. This fact inspires 
the identification of hJ with angular momentum. 

Above we used a very general argument to infer that the existence of 
a unitary operator that commutes with the Hamiltonian implies that the 
system has a symmetry. In §4.1.4 an explicit calculation (eq. 4.43) showed 
that reflection symmetry of the potential energy implied that the potential- 
energy operator V commutes with the parity operator P. This argument 
generalises to other transformation operators. For example, suppose V (x) is 
invariant under some rotation V(R(a:)x) = V(x). Then 

{x\VU (a)\ip) = V(x)(x\U(a)\ip) = V r (x)(R(a)x|'!/>), (4.54a) 

while 

(x\U (a)H|V ; ) = (R(a)x|V|^>) = V(R(a)x)(R(a)x| , 0), (4.54b) 


so and the operator equation UV = VU follows from the equality of V (R(a)x) 
and V(x). 

In general, finding all the operators that commute with a given Hamilto¬ 
nian is a very difficult problem. However, it is sometimes possible to deduce 
conserved quantities by direct inspection. For example, the Hamiltonian for 
a system of n particles that interact with each other, but not with anything 
else, is 


h = J2 



i<j 


(4.55) 


where the potential-energy function V only depends on the relative positions 
of the individual particles. Such a Hamiltonian is invariant under translations 
of all particles together (shifts of the centre of mass coordinate) and thus the 
total momentum p tot = JA p.j of this system is conserved. 

If the Hamiltonian is a scalar, then [H, J] = 0, [H, J 2 ] = 0 and [if, P] = 0 
(Problem 4.10), which implies conservation of angular momentum around 
any axis, conservation of total angular momentum, and conservation of par¬ 
ity. We have already seen that [J, J 2 ] = [J, P] =0, so for a scalar Hamilto¬ 
nian we can find complete sets of simultaneous eigenkets of if, P , J 2 , and 
any one of the components of J. 
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The equation [H, P] = 0 implies that if you set up a system that at 
t = 0 is a mirror image of a given system, it will evolve in exactly the same 
way as the given system. When the evolution of the mirrored system is 
watched, it will appear identical to the evolution of the given system when 
the latter is observed in a mirror. Hence, when [H,P] = 0, it is impossible 
to tell whether a system that is being observed, is being watched directly or 
through a mirror. One of the major surprises of 20 th century physics was an 
experiment by Wu et al. 4 in 1957, which showed that you can see things in a 
mirror that cannot happen in the real world! That is, there are Hamiltonians 
for which [ H , P] ^ 0. 


4.4 The Heisenberg picture 

All physical predictions are extracted from the formalism of quantum me¬ 
chanics by operating with a bra on a ket to extract a complex number: we 
either calculate the amplitude for some event as A = (<f>\i[)) or the expecta¬ 
tion value of a observable through (Q) = {if>\Qip), where \Qtp) = Q\ip)- In 
general our predictions are time-dependent because the state of our system 
evolves in time according to 


\tp,t) = U(t)\ip,0), (4.56) 

where the time-evolution operator U is defined by equation (4.52). 

With every operator of interest we can associate a new time-dependent 
operator 

Qt = U\t)QU{i). (4.57) 

Then at any time t the expectation value of Q can be written 

(Q) t = = (ip,0\U\t)QU{t)\il},0) = (V>,0|<2t|^,0). (4.58) 

That is, the expectation value at time t = 0 of the new operator Q t is 
equal to the expectation value of the original, physical, operator Q at time t. 
Similarly, when we wish to calculate an amplitude {(j>,t\‘ip,t) for something 
to happen at time t, we can argue that on account of the unitarity of U (t) 
it is equal to a corresponding amplitude at time zero: 

((f>, t\ip, t) = (</>, 0\ip, 0) where |</>, t) = U(t)\<f>, 0). (4.59) 


Thus if we work with the new time-dependent operators such as Qt, the 
only states we require are those at t = 0. This formalism, is called the 
Heisenberg picture to distinguish it from the Schrodinger picture in 
which states evolve and operators are normally time-independent. 

As we have seen, classical mechanics applies in the limit that it is suffi¬ 
cient to calculate the expectation values of observables, and is concerned with 
solving the equations of motion of these expectation values. In the Heisen¬ 
berg picture quantum mechanics is concerned with solving the equations of 
motion of the time-dependent operators Qt, etc. Consequently, there is a 
degree of similarity between the Heisenberg picture and classical mechanics. 

It is straightforward to determine the equation of motion of Q t : we 
simply differentiate equation (4.57) 


dQ t 

df 


dW 

dt 


QU + U^Q—. 

dt 


(4.60) 


4 Wu, C.S., Ambler, E., Hayward, R.W., Hoppes, D.D. & Hudson, R.P., 1957, Phys. 
Rev., 105, 1413 
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But differentiating equation (4.52) we have 


dU 

df 



d t h ’ 


(4.61) 


where we have taken advantage of the fact that U is a function of H and 
therefore commutes with it. Inserting these expressions into equation (4.60) 
we obtain __ 

ir 'Vr = -»V'QU + U'QUH (462) 

= [Qt,H\. 

This result is similar to Ehrenfest’s theorem (eq. 2.34) as it has to be because 
Ehrenfest’s theorem must be recovered if we pre- and post-multiply each side 
by the time-independent state \ip, 0). 

The Heisenberg picture is most widely used in the quantum theory of 
fields. In this theory one needs essentially only one state, the vacuum in the 
remote past 10), which we assume was empty. Excitations of the vacuum 
are interpreted as particles, each mode of excitation being associated with a 
different type of particle (photons, electron, up-quarks, etc). The theory is 
concerned with the dynamics of operators that excite the vacuum, creating 
particles, which then propagate to other locations, where they are detected 
(annihilated) by similar operators. Sometimes one mode of excitation of the 
vacuum morphs into one or more different modes of the vacuum, and such an 
event is interpreted as the decay of one type of particle into other particles. 
The amplitude for any such sequence of events is obtained as a number of the 
form (1)\AiA2 ... A n \Q), where the operators Ai are creation or annihilation 
operators for the appropriate particles in the Heisenberg picture. 


4.5 What is the essence of quantum mechanics? 

It is sometimes said that commutation relations such as [xi,pj\ = i hSij and 
[Ji, Jj] = CijkJk are inherently quantum mechanical, but this is not true. 

Take for example an ordinary classical rotation matrix R(a) which ro¬ 
tates spatial vectors as v —> v' = R(a)v. Define matrices J x , J y and J z 
via 

exp (—ia • J) = R(a), (4.63) 

where the exponential of a matrix is defined in terms of the power series for 
e x . Clearly, the Ji must be 3 x 3 matrices, and, since R(a) is real and the 
angles a are arbitrary, the Ji must be pure imaginary. Finally, orthogonality 
of R requires 


I = R T (a)R(a) = exp(— ia • J) T exp(— ia • J) 

T (4.64) 

= exp(—ia ■ J ) exp(—ia ■ J) 

We express a in terms of the angle of the rotation it represents, 6 = |a|, 
and the direction n = a/|a| of the rotation axis, and then we differentiate 
equation (4.64) with respect to 9. We obtain 

0 = —in ■ J T exp(—i#n ■ J v ) exp(—i#n ■ J) 

+ exp(— \9n ■ J T ) exp(— \9n ■ J ){—in ■ J) (4.65) 

= -in -{J T A J}. 

Since n is an arbitrary unit vector, it now follows that J^ = — Ji, so Ji 
is antisymmetric. A pure imaginary antisymmetric matrix is a Hermitian 
matrix. Thus the J7) are Hermitian. 



72 


Chapter 4: Transformations & Observables 


For any two vectors a and /3, it is easy to show that the product 
R t (c*)R(/3)R(a:) is an orthogonal matrix with determinant +1, so it is a 
rotation matrix. It leaves the vector 0 = R(— ot)/3 = R T (a)/3 invariant: 

jR T (Q0R(/3)R(Q0}/3' = R T (a)R(/3)/3 = R T (a)/3 = 0. (4.66) 

Hence 0 is the axis of this rotation. Therefore 

R t (q:)R(/3)R(qO = R(/3') = R(R(-a)/3). (4.67) 

In Box 4.3 we showed that when |a| is infinitesimal, R(— a)/3 ~ (3 — ot x (3, 
so when [3 is also infinitesimal, equation (4.67) can be written in terms of 
the classical generators (4.63) as 

(1 + ia • J) (1 - i/3 • J) (1 - ia ■ J) ~ 1 - i(/3 - a x 0) ■ J. (4.68) 

The zeroth order terms (‘1’) and those involving only a or (3 cancel, but the 
terms involving both a and (3 cancel only if 

tijkJk- (4.69) 

k 

This equation can hold for all directions a and (3 only if the Ji satisfy 

[JuJj] = i£ eijkfJkt (4.70) 

k 

which is identical to the ‘quantum’ commutation relation (4.33). Our red¬ 
erivation of these commutation relations from entirely classical considerations 
is possible because the relations reflect the fact that the order in which you 
rotate an object around two different axes matters (Problem 4.6). This is 
a statement about the geometry of space that has to be recognised by both 
quantum and classical mechanics. 

In Appendix D it is shown that in classical statistical mechanics, each 
component of position, Xi, and momentum, Pi, is associated with a Herrni- 
tian operator Xi or pi that acts on functions on phase space. The operator 
Pi generates translations along Xi, while Xi generates translations along p % 
(boosts). The operators Li associated with angular momentum satisfy the 
commutation relation [L Xl L y \ = iHL z , where LL is a number with the same 
dimensions as Ti and a magnitude that depends on how x t and pt are nor¬ 
malised. 

If the form of the commutation relations is not special to quantum me¬ 
chanics, what is? In quantum mechanics, complete information about any 
system is contained in its ket \ip). There is nothing else. From 1 0) we can 
evaluate amplitudes such as (x, p\ip) for the system to be found at x with 
orientation p. If we do not care about /x, the total probability for | ip) to be 
found at x is 

Prob(at x.\ip) = ^|(x, yx|'i/>)| 2 . (4-71) 

Eigenstates of the x operator with eigenvalue Xo are states in which the 
system is definitely at xo, while eigenstates of the p operator with eigenvalue 
Tik. are states in which the system definitely has momentum hk. 

By contrast, in classical statistical mechanics we declare at the outset 
that a well defined state is one that has definite values for all measurable 
quantities, so it has a definite position, momentum, orientation etc. The 
eigenfunctions of p or L do not represent states of definite momentum or 
angular momentum, because we have already defined what such states are. 

Classical statistical mechanics knows nothing about probability ampli¬ 
tudes, but interprets the functions on phase space on which p or L act 
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as probability distributions. This is possible because, as we show in Ap¬ 
pendix D, the integral of such a distribution can be normalised to one and is 
conserved. We can certainly expand any such distribution in the eigenfunc¬ 
tions of, say p. However, as in quantum mechanics the expansion coefficients 
will not be positive - in fact, they will generally be complex. Hence they 
cannot be interpreted as probabilities. What makes quantum mechanics fun¬ 
damentally different is its reliance on complex quantum amplitudes, and the 
physical interpretation that it gives to a functional expansion through the 
fundamental rule (1.11) for adding quantum amplitudes. Quantum mechan¬ 
ics is therefore naturally formulated in terms of states | ip) that inhabit a 
complex vector space of arbitrary dimension - a so called Hilbert space. 
These states may always be expanded in terms of a complete set of eigen¬ 
states of a Hermitian operator, and the (complex) expansion coefficients have 
a simple physical interpretation. 

Classical statistical mechanics is restricted to probabilities, which have 
to be real, non-negative numbers and are therefore never expansion coeffi¬ 
cients. Quantum and classical mechanics incorporate the same commutation 
relations, however, because, as we stressed in §4.2, these follow from the 
geometry of space. From a mathematician’s perspective, the commutation 
relations of quantum-mechanical operators and the operators of classical sta¬ 
tistical physics have to be the same because both systems of operators pro¬ 
vide representations of the ‘Lie algebra’ of the same mathematical group 
(Appendix E). 

Problems 

4.1 Verify that [J,x • x] =0 and [J,x ■ p] = 0 by using the commutation 
relations [x,, Jj\ =i J2k e ijk x k and \pi,Jj] =i J2k e ijkPk- 

4.2* Show that the vector product a x b of two classical vectors transforms 
like a vector under rotations. Hint: A rotation matrix R satisfies the relations 
R • R t = I and clet(R) = 1, which in tensor notation read RipRtp = Sit 
and eijkRirRjsRkt — e r st- 

4.3* We have shown that [v-i , .J 3 } = i e^k'Ok for any operator whose 
components u, form a vector. The expectation value of this operator relation 
in any state \4>) is then {ip\[vi, Jj]\ip) = i Sfc e iife(V , K’fe|V’)- Check that with 
U{ol ) = e -1 “' J this relation is consistent under a further rotation \ip) —> 
\tp') = U(a)\ i/j) by evaluating both sides separately. 

4.4* The matrix for rotating an ordinary vector by (j) around the 2 axis is 

( cos (j) —sin cj) 0 \ 

sin^) cos </> 0 J (4.72) 

0 0 l) 

By considering the form taken by R for infinitesimal cf> calculate from R the 
matrix J z that appears in R(c/>) = exp(—iIntroduce new coordinates 
Mi = (—x + i y)/y/2, U 2 = z and 113 = (x + iy)/^/2. Write down the matrix M 

that appears in u = M ■ x [where x = (x, y, z)\ and show that it is unitary. 

Then show that 

Jl = M J z M*. (4.73) 

is identical with S z in the set of spin-one Pauli analogues 


1 /° f 0 

S* = -jz 1 0 1 

v 2 \0 1 0 


! /0 —i 0 
Sy = -p: i 0 -i 
\ n i n 


/I 0 0 

s z = ( 0 0 0 
V 0 0 -1 


(4.74) 

Write down the matrix 3, whose exponential generates rotations around 
the x axis, calculate J' x by analogy with equation (4.73) and check that 
your result agrees with S x in the set (4.74). Explain as fully as you can the 
meaning of these calculations. 
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4.5 Determine the commutator \ J' X - J' z \ of the generators used in Problem 
4.4. Show that it is equal to —i J', where J' y is identical with S y in the set 
(4.74). 

4.6* Show that if a. and (3 are non-parallel vectors, a is not invariant under 
the combined rotation R(c*)R(/3). Hence show that R 1 (/3)R T (ct)R(/3)R(a) 
is not the identity operation. Explain the physical significance of this result. 

4.7* In this problem you derive the wavefunction 

(x|p) = e ip ' x/n (4.75) 

of a state of well defined momentum from the properties of the translation op¬ 
erator 17(a). The state |k) is one of well-defined momentum Tik.. How would 
you characterise the state |k') = U(a)|k)? Show that the wavefunctions of 
these states are related by itk'(x) = e _lak tik(x) and Wk'( x ) = rtk(x — a). 
Hence obtain equation (4.75). 

4.8 By expanding the anticommutator on the left and then applying the 
third rule of the set (2.22), show that any three operators satisfy the identity 

[{A, B}, C] = {A, [B, C\} + {[71, C\,B}. (4.76) 


4.9 Define G in terms of the parity operator P by 

G = 7(1 — P). (4.77) 

Show that G is Hermitian and that G n = G for positive integer n. Explain 
this result in terms of the eigenkets and eigenvalues of G. Show further that 
P = U(n) where U(s) = e lsG . 

4.10 Let P be the parity operator and S an arbitrary scalar operator. 
Explain why P and S must commute. 

4.11 In this problem we consider discrete transformations other than that 
associated with parity. Let S be a linear transformation on ordinary three- 
dimensional space that effects a a reflection in a plane. Let S be the asso¬ 
ciated operator on kets. Explain the physical relationship between the kets 
| if>) and | ip') = S \ip). Explain why we can write 

S(ip\x\ip) = ('0|S , ' f xS'[0). (4.78) 

What are the possible eigenvalues of S'? 

Given that S reflects in the plane through the origin with unit normal 
n, show, by means of a diagram or otherwise, that its matrix is given by 

Sij = $ij ~ 2 riirij. (4.79) 

Determine the form of this matrix in the case that n = (1, —1, 0)/y/2. Show 
that in this case Sx = yS and give an alternative expression for Sy. 

Show that a potential of the form 


H(x) = f(R) + A xy, where R = \J x 1 + y 2 (4.80) 

satisfies V (Sx) = V (x) and explain the geometrical significance of this equa¬ 
tion. Show that [S, V] = 0. Given that E is an eigenvalue of H = p 2 /2m + V 
that has a unique eigenket \E), what equation does | E) satisfy in addition 
toH\E)=E\E)7 

4.12 Show that the operator defined by (x,y\S\4>) = (y,x\ip) is Hermitian. 
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We follow up our study of the harmonic oscillator by looking at motion in 
a wider range of one-dimensional potentials V(x). The potentials we study 
will be artificial in that they will only vary in sharp steps, but they will 
enable us to explore analytically some features of quantum mechanics that 
are generic and hidden from us in the classical limit. We start by considering 
a particle that is trapped in a potential well and go on to consider a particle 
that has a choice of two wells. We find that in this case it can move between 
these wells in violation of classical mechanics, and we use this simple system 
to mode the operation of an ammonia maser. In §5.3 we ask how potential 
wells and barriers affect the motion of a free particle - one that can escape 
to infinity. We find that whereas in classical mechanics the particle is never 
reflected by a potential well, in quantum mechanics there is generally a non¬ 
zero amplitude for such reflection. We find also that particles can “tunnel” 
through barriers that classically would certainly reflect them. 


5.1 Square potential well 


We look for energy eigenstates of a particle that moves in the potential 
(Figure 5.1) 


V(x) 


0 for |x| < a 

Vo > 0 otherwise. 


(5.1) 


Since V is an even function of x, the Hamiltonian (2.51) commutes with the 
parity operator P (page 67). So there is a complete set of energy eigenstates 
of well defined parity. The wavefunctions u(x) = (x\E) of these states will 
be either even or odd functions of x , and this fact will greatly simplify the 
job of determining u{x). 

In the position representation, the governing equation (the tise 2.33) 
reads 

f, 2 d 2 U 

-^^>+ V W u = Eu - ( 5 - 2 ) 


On account of the step-like nature of V, equation (5.2) reduces to a pair of 
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x/a 

Figure 5.1 The dotted line shows the square-well potential V(x). The full curve shows 
the ground-state wavefunction. 


extremely simple equations, 

d 2 u 2 mE 

dx 2 h 2 

d 2 u 2m(Vo — E) 
dx 2 h 2 

We restrict ourselves to solutions that describe a particle that is bound by 
the potential well in the sense that E < Vq. 1 Then the solution to the second 
equation is u(x ) = Ae ±Kx , where A is a constant and 


for \x\ < a 


otherwise. 


(5.3) 


K = 


l2m(V 0 -E) 


(5.4) 


If u is to be normalisable, it must vanish as \x\ —> oo. So at x > a we have 
u(x) = Ae Kx , and at x < —a we have u(x) = ±Ae +Kx , where the plus 
sign is required for solutions of even parity, and the minus sign is required 
for odd parity. 

For E > 0, the solution to the first of equations (5.3) is either u(x) = 
B cos (kx) or u(x) = B sin (kx) depending on the parity, where 


k = 


2 mE 


(5.5) 


So far we have ensured that u(x) solves the tise everywhere except 
at |ar| = a. Unless u is continuous at these points, dit/d.T will be arbitrarily 
large, and d 2 i(/d.x 2 will be undefined, so u will not satisfy the tise. Similarly, 
unless du/dx is continuous at these points, d 2 u/dx 2 will be arbitrarily large, 
so u cannot solve the tise. Therefore, we require that both u and dit/dx are 
continuous at x = a, that is 


B cos (ka) = Ae Ka 
—kB sin(fca) = -KAe~ Ka 


or 


B sin(fca) = Ae~ Ka 
kBcos(ka) = —KAe~ Ka 


(5.6) 


where the first pair of equations apply in the case of even parity and the 
second in the case of odd parity. It is easy to show that once these equa¬ 
tions have been satisfied, the corresponding equations for x = — a will be 
automatically satisfied. 

1 By considering the behaviour of u near the origin we can prove that E > 0. 
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Figure 5.2 Plots of the left (full) and right (dashed) sides of equation (5.8) for the case 
VP = 10. 



We eliminate A and B from equations (5.6) by dividing the second 
equation in each set by the first. In the case of even parity we obtain 

fctan(fca) = K = ^ ° — k 2 . (5.7) 

This is an algebraic equation for k, which controls E through (5.5). Before 
attempting to solve this equation, it is useful to rewrite it as 

tan(fco) = -v/rr-Ta — 1 where w = \l (5-8) 
y {ka) 2 V h 

W and ka are dimensionless variables. The left and right sides of equation 
(5.8) are plotted as functions of ka in Figure 5.2. Since for ka = 0 the graphs 
of the two sides start at the origin and infinity, and the graph of the left side 
increases to infinity at ka = 7t/2 while the graph of the left side terminates 
at ka = W, the equation always has at least one solution. Thus no matter 
how small Vo and a are, the square well can always trap the particle. The 
bigger W is, the more solutions the equation has; a second solution appears 
at W = 7r, a third at IT = 27 t, etc. 

Analogously one can show that for an odd-parity energy eigenstate to 
exist, we must have W > 7r/2 and that additional solutions appear when 
W = (2 r + l)7r/2 for r = 1, 2,... (Problem 5.5). 

From a naive perspective our discovery that no matter how narrow or 
shallow it is, a square potential well always has at least one bound state, 
conflicts with the uncertainty principle: the particle’s momentum cannot 
exceed p max = V / 2 rnE < ^2mVo and can have either sign, so if the particle 
were confined within the well, the product of the uncertainties in p and x 
would be less than 4ap max < 4-\/2mVoa 2 = 4 hW, which tends to zero with 
W. The resolution of this apparent paradox is that for W <C 1 the particle is 
not confined within the well; there is a non-negligible probability of finding 
the particle in the classically forbidden region \x\ > a. In the limit W —> 0 
the particle is confined by a well in which it is certain never to be found! 
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Figure 5.4 The wavefunctions of the lowest three stationary states of the infinitely deep 
square well: ground state (full); first excited state (dashed); second excited state (dotted). 


Our result that a square well always has a bound state can be extended 
to potential wells of any shape: given the potential well U sketched in Fig¬ 
ure 5.3, we consider the square well shown by the dashed line in the figure. 
Since this shallower and narrower well has a bound state, we infer that the 
potential U also has at least one bound state. 


5.1.1 Limiting cases 

(a) Infinitely deep well It is worthwhile to investigate the behaviour 
of these solutions as Vo —> oo with a fixed, when the well becomes infinitely 
deep. Then W —> oo and the dashed curve in Figure 5.2 moves higher 
and higher up the paper and hits the x axis further and further to the 
right. Consequently, the values of ka that solve equation (5.8) tend towards 
ka = (2 r + l)7r/2, so the even-parity energy eigenfunctions become 


* 0 ) = { 


_ f Acos[(2r + l)7nr/2a] 


0 


\x\ < a 
otherwise. 


(5.9) 


This solution has a discontinuity in its gradient at x = a because it is the 
limit of solutions in which the curvature K for x > a diverges to infinity. The 
odd-parity solutions are obtained by replacing the cosine with sin(s7ra;/a), 
where s = 1,2,..., which again vanish at the edge of the well (Figure 5.4). 
From this example we infer the principle that wavefunctions vanish at the 
edges of regions of infinite potential energy. 

The energy of any stationary state of an infinite square potential well 
can be obtained from 


n“ f fnt\ 


E n — -— — , where n = 1,2,... 

8?7i \ a ) 


(5.10) 


The particle’s momentum when it is in the ground state (n = 1) is of order 
fik = hit/ 2 a and of undetermined sign, so the uncertainty in the momentum 
is A p ~ fi.it/a. The uncertainty in the particle’s position is ~ 2a, so 
A x A p ~ 2hit, consistent with the uncertainty principle (§2.3.2). 

(b) Infinitely narrow well In §11.5.1 we will study a model of covalent 
bonding that involves the potential obtained by letting the width of the 
square well tend to zero as the well becomes deeper and deeper in such a 
way that the product Vo a remains constant. In this limit W oc ay/Vo (eq. 5.8) 
tends to zero, so there is only one bound state and it will be an even-parity 
state. 

Rather than obtaining the wavefunction and energy of this state from 
formulae already in hand, it is more convenient to reformulate the problem 
using a different normalisation for the energy: we now set V to zero outside 
the well, so V becomes negative at interior points. Then we can write V (x) = 
— VgS(x), where S(x) is the Dirac delta function and Vs > 0. The tise now 
reads 

fi 2 d 2 u 

2 m da: 2 


VgS(x)u = Eu. 


(5.11) 
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Figure 5.5 Wavefunction of a particle trapped by a very narrow, deep potential well. 


Integrating the equation from x = — e to x = e with e infinitesimal, we find 

(5.12) 


d u 
dx 


= ^V^u(O) + E J d xii'j . 


Since u is finite, the integral on the right can be made as small as we please 
by letting e —> 0. Hence the content of equation (5.12) is that du/dx has a 
discontinuity at the origin: 



2 mVg 

IT 


u{ o). 


(5.13) 


Since we know that the solution we seek has even parity, it is of the 
form u(x) = Ae^ Kx , where the minus sign applies for x > 0 (Figure 5.5). 
Substituting this form of u into (5.13) and dividing through by 2 A we have 


K = 


mV s 

h 2 


(5.14) 


Inserting u = e Kx into equation (5.11) at x > 0 we find that E = —h 2 K 2 /2m, 
so the energy of a particle that is bound to a (5-function potential is 


E = — 


mV 2 
2h 2 ■ 


(5.15) 


Figure 5.5 shows that (^(ar) | 2 is finite in the well, and the well in infinitely 
narrow, so the probability of finding the particle in the well is zero - the 
particle is certain never to be in the well that traps it! This result is an 
extreme case of the phenomenon we discussed apropos the application of the 
uncertainty principle to a shallow well of finite depth. 


5.2 A pair of square wells 

Some important phenomena can be illustrated by considering motion in a 
pair of potentials that are separated by a barrier of finite height and width. 
Figure 5.6 shows the potential 


(V 0 

for \x\ < a 

o 

II 

(7T 

for a < \x\ < b 

v oo 

otherwise. 


Since the potential is an even function of x, we may assume that the energy 
eigenfunctions that we seek are of well-defined parity. 

For simplicity we take the potential to be infinite for jar| > 5, and we 
assume that the particle is classically forbidden in the region \x\ < a. Then 
in this region the wavefunction must be of the form u(x) = A cosh(.ftT:r) 
or u(x) = Hsinh(.ftT;r) depending on parity, and K is given by (5.4). In 
the region a < x < b the wavefunction may be taken to be of the form 
u(x) = B sin (/ex + (f>), where B , and (f) are constants to be determined and k 
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Figure 5.6 A double potential well with 6/a = 5. 



Figure 5.7 Full curves: the left side of equation (5.19) for the case W = 3.5, 6 = 5a. 
Each vertical section is associated with a different value of the integer r. The right side is 
shown by the dotted curve for even parity, and the dashed curve for odd parity. 


is related to the energy by (5.5). From our study of a single square well we 
know that u must vanish at x = 6, so 


sin(fcfe+ </>)= 0 => </> = m — kb with r = 0,1,... (5-17) 


Again by analogy with the case of a single square well, we require u and its 
derivative to be continuous at x = a. so (depending on parity) 


cosh(A'a) = B sin(fca + <f>) 1 f sinh(A'a) = B sin (ka + <j>) 


K sinh(A'a) = kB cos (ka + (/>)) i K cosh(A'a) = kB cos (ka + <f>). 

(5- 18 ) 

Once these equations have been solved, the corresponding conditions at 
x = —a will be automatically satisfied if for —b<x< —a we take u = 
±Bsin(fc|a;| + <f>), using the plus sign in the even-parity case. 

Using (5.17) to eliminate (f> from equations (5.18) and then proceeding 
in close analogy with the working below equations (5.6), we find 


tan [r7r 


k{b — a)] 



cotli ( \JW 2 — (/ca) 2 ) 
tanh ( sJW 2 — (fca) 2 ) 


even parity 

odd parity, 

(5.19) 


where W is defined by equation (5.8). 
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Figure 5.8 The ground state (full curve) and the associated odd-parity state (dashed 
curve) of the double square-well potential (shown dotted). 


The left and right sides of equation (5.19) are plotted in Figure 5.7; the 
values of ka for stationary states correspond to intersections of the steeply 
sloping curves of the left side with the initially horizontal curves of the right 
side. The smallest value of ka is associated with the ground state. The values 
come in pairs, one for an even-parity state, and one very slightly larger for 
an odd-parity state. The difference between the k values in a pair increases 
with k. 

The closeness of the k values in a given pair ensures that in the right- 
hand well (a < x < b) the wavefunctions u e (x) and u 0 (x) of the even- and 
odd-parity states are very similar, and that in the left-hand well u e and u 0 
differ by little more than sign - see Figure 5.8. Moreover, when the k values 
are similar, the amplitude of the wavefunction is small in the classically 
forbidden region |x| < a. Hence, the linear combinations 


i>±{x) = [«e(») ± u 0 {x)\ 


(5.20) 


are the wavefunctions of a state |t/>+) in which the particle is almost certain 
to be in the right-hand well, and a state \ip~) in which it is equally certain 
to be in the left-hand well. 

Consider now how the system evolves if at time 0 it is in the state \ip+), 
so the particle is in the right-hand well. Then by equation (2.32) at time t 
its wavefunction is 


i/j{x,t) 


1 

72 L 

-iEet/h 

V* 


u e {x)e-' lE ° t/h + u 0 (x)e~ iE ° t/n 

u e (x) + Uo(x)e- i( ~ E °- E °W h 


(5.21) 


After a time T = nTi/{E 0 — E e ) the exponential in the square brackets on the 
second line of this equation equals — 1 , so to within an overall phase factor 
the wavefunction has become [u e (x) — u 0 [x)\/i/ 2 , implying that the particle 
is certainly in the left-hand well; we say that in the interval T the particle has 
tunnelled through the barrier that divides the wells. After a further period 
T it is certainly in the right-hand well, and so on ad infinitum. In classical 
physics the particle would stay in whatever well it was in initially. In fact, the 
position of a familiar light switch is governed by a potential that consists of 
two similar adjacent potential wells, and such switches most definitely do not 
oscillate between their on and off positions. We do not observe tunnelling in 
the classical regime because E 0 — E e decreases with increasing W faster than 
e — 21 V (p r oble m 5.16), so the time required for tunnelling to occur increases 
faster than e 2W and is enormously long for classical systems such as light 
switches. 


5.2.1 Ammonia 

Nature provides us with a beautiful physical realisation of a system with a 
double potential well in the ammonia molecule NH 3 . Ammonia contains four 
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Figure 5.9 The two possible relative locations of nitrogen and hydrogen atoms in NH 3 . 


nuclei and ten electrons, so is really a very complicated dynamical system. 
However, in §11.5.2 we shall show that a useful way of thinking about the 
low-energy behaviour of molecules is to imagine that the electrons provide 
light springs, which hold the nuclei together. The nuclei oscillate around the 
equilibrium positions defined by the potential energy of these springs. In 
the case of NH 3 , the potential energy is minimised when the three hydrogen 
atoms are arranged at the vertices of an equilateral triangle, while the ni¬ 
trogen atom lies some distance x away from the plane of the triangle, either 
‘above’ or ‘below’ it (see Figure 5.9). Hence if we were to plot the molecule’s 
potential energy as a function of x, we would obtain a graph that looked like 
Figure 5.6 except that the sides of the wells would be sloping rather than 
straight. This function would yield eigenenergies that came in pairs, as in 
our square-well example. 

In many physical situations the molecule would have so little energy that 
it could have negligible amplitudes to be found in any but the two lowest- 
lying stationary states, and we would obtain an excellent approximation to 
the dynamics of ammonia by including only the amplitudes to be found in 
these two states. We now use Dirac notation to study this dynamics. 

Let |+) be the state whose wavefunction is analogous to the wavefunction 
ip+{x) defined above in the case of the double square well; then ip+(x) = 
(x|+), and in the state |+) the N atom is certainly above the plane containing 
the H atoms. The ket |—) is the complementary state in which the N atom 
lies below the plane of the H atoms. 

The |±) states are linear combinations of the eigenkets |e) and |o) of the 
Hamiltonian: 


l±> 


^(|e)±|o». 


(5.22) 


In the |±) basis the matrix elements of the Hamiltonian H are 


(+|iL|+) — |((e| + (o|)iJ(|e) + |o» — \{E e + E 0 ) 

{+\H\~) = ±«e| + <o|)if(|e> - |o» = \{E e - E 0 ) (5.23) 

= U(*\ ~ <o|)tf (|e> - |o» = \{E e + E 0 ) 


Bearing in mind that H is represented by a Hermitian matrix, we conclude 
that it is _ 

H ={-A "b)> < 5 - 24 > 

where E = \{E e + E 0 ) and A = \(E 0 — E e ) are both positive. 

Now the electronic structure of NH 3 is such that the N atom carries 
a small negative charge — q, with a corresponding positive charge +q dis¬ 
tributed among the H atoms. With NH 3 in either the |+) or |—) state there 
is a net separation of charge, so an ammonia molecule in these states pos¬ 
sesses an electric dipole moment of magnitude qs directed perpendicular to 
the plane of H atoms (see Figure 5.9), where s is a small distance. 

Below equation (5.21) we saw that a molecule that is initially in the 
state |+) will subsequently oscillate between this state and the state |—) at 
a frequency (E 0 — E e )/2n?i = A/ttH. Hence a molecule that starts in the 
state |+) is an oscillating dipole and it will emit electromagnetic radiation 
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Figure 5.10 Energy levels of the 
ammonia molecule as a function of 
external electric field strength 8. 
The quantity plotted, A E = E — E. 


at the frequency A/nh. This proves to be 150 GHz, so the molecule emits 
microwave radiation. 

The ammonia maser The energy 2 A that separates the ground and first 
excited states of ammonia in zero electric field is small, 10 _4 eV. Conse¬ 
quently at room temperature similar numbers of molecules are in these two 
states. The principle of an ammonia maser 2 is to isolate the molecules that 
are in the first excited state, and then to harvest the radiation that is emit¬ 
ted as the molecules decay to the ground state. The isolation is achieved 
by exploiting the fact that, as we now show, when an electric field is ap¬ 
plied, molecules in the ground and first excited states develop polarisations 
of opposite sign. 

We define the dipole-moment operator P by 


P\+) = -qs\+) ; P|-) = + 9 s|->, (5.25) 


so a molecule in the |+) state has dipole moment — qs and a molecule in the 
|—) state has dipole moment -hgs. 3 To measure this dipole moment, we can 
place the molecule in an electric field of magnitude £ parallel to the dipole 
axis. Since the energy of interaction between a dipole P and an electric field 
£ is —P£, the new Hamiltonian is 


( E + q£s _-A \ 
^ —A E — q£s J 

This new Hamiltonian has eigenvalues 


E ± = E ± y A 2 + {q£s ) 2 . 


(5.26) 


(5.27) 


These are plotted as a function of field £ in Figure 5.10. When £ = 0 the 
energy levels are the same as b efore. As £ slowly increases, E increases 
quadratically with £, because \JA 2 + ( q£s) 2 ~ A + (q£s) 2 /2A, but when 
£ A/qs the energy eigenvalues change linearly with £. Notice that in this 
large-field limit, at lowest order the energy levels do not depend on A. 

The physical interpretation of these results is the following. In the 
absence of an electric field, the energy eigenstates are the states of well- 
defined parity |e) and |o), which have no dipole moment. An electric field 
breaks the symmetry between the two potential wells, making it energetically 
favourable for the N atom to occupy the well to which the electric field is 
pushing it. Consequently, the ground state develops a dipole moment P, 
which is proportional to £. Thus at this stage the electric contribution to 
the energy of the ground state, which is —P£, is proportional to £ 2 . Once 


2 ‘maser’ is an acronym for “microwave amplification by stimulated emission of radiation”. 

3 The N atom is negatively charged so the dipole points away from it. 
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this contribution exceeds the separation A between the states of well-defined 
parity, the molecule has shifted to the lower-energy state of the pair |±), 
and it stays in this state as the electric field is increased further. Thus for 
large fields the polarisation of the ground state is independent of E and the 
electric contribution to the energy is simply proportional to E. 

While the ground state develops a dipole moment that lowers its energy, 
the first excited state develops the opposite polarisation, so the electric field 
raises the energy of this state, as shown in Figure 5.10. The response of the 
first excited state is anomalous from a classical perspective. 

Ehrenfest’s theorem (2.57) tells us that the expectation values of oper¬ 
ators obey classical equations of motion. In particular the momentum of a 
molecule obeys 


d ipx) = _ / dV\ 
d t \ dx / ’ 


(5.28) 


where £ is a Cartesian coordinate of the molecule’s centre of mass. The 
potential depends on x only through the electric field E , so 


dV _ d£ 
dx dx ’ 

from which it follows that 

d {Px) = . . d£_ 
d t [ ' dx ' 


(5.29) 


(5.30) 


Since the sign of (P) and therefore the force on a molecule depends on 
whether the molecule is in the ground or first excited state, when a jet of 
ammonia passes through a region of varying E , molecules in the first excited 
state can be separated from those in the ground state. 

Having gathered the molecules that are in the excited state, we lead 
them to a cavity that resonates with the 150 GHz radiation that is emitted 
when molecules drop into the ground state. The operation of an ammonia 
maser by Charles Townes and colleagues 4 was the first demonstration of 
stimulated emission and opened up the revolution in science and technology 
that lasers have have since wrought. 


5.3 Scattering of free particles 

We now consider what happens when a particle that is travelling parallel 
to the x axis encounters a region of sharply changed potential energy. In 
classical physics the outcome depends critically on whether the potential 
rises by more than the kinetic energy of the incoming particle: if it does, the 
particle is certainly reflected, while it continues moving towards positive x in 
the contrary case. We shall find that quantum mechanics predicts that there 
are usually non-vanishing probabilities for both reflection and transmission 
regardless of whether the rise in potential exceeds the initial kinetic energy. 

We assume that each particle has well-defined energy E, so its wave- 
function satisfies the tise (5.2). We take the potential to be (Figure 5.11) 

V={^ f0r J x|<a (5.31) 

0 otherwise, 

where Vq is a constant. At \x\ > a the relevant solutions of (5.2) are 


g±i kx 


or 


J sin (kx + (/>) at x > a 
\ ± sin (—kx + 4>) at x < —a 


with 


k = 


2 mE 


(5.32a) 


4 Gordon, J.P., Zeiger, H.J., & Townes, C.H., 1954, Phys. Rev, 95, 282 (1954) 
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Vo |- 

if e =B sin(-kx+0) cosh(Kx) 

if 0 = — B’ sin(-kx + 0’) A sinh(Kx) 

0 - 

— a a 


B sin(kx+0) 
B’ sin(kx+0’) 


Figure 5.11 A square, classically forbidden barrier and the functional forms for stationary 
states of even (top) and odd parity. 


where 0 is a constant phase. Since the time dependence of these stationary 
states is obtained by introducing the factor e ~ lEt / n ; a plus sign in e ±lkx 
implies that the particle is moving to the right, and a minus sign is associated 
with movement to the left. A wavefunction that is proportional to sin (kx+(f>) 
contains both types of wave with amplitudes of equal magnitude, so it makes 
motion in either direction equally likely. At \x\ < a the relevant solutions of 
(5.2) are 

±iKx f cos (Kx) 1 , „ /2 m(E — V 0 ) , „ T * 

l sm(A'x) J y h 2 

&±Kx Qr |“>sM^)| with K= ^2m(Vo -E) when A < Vo. 

(5.32b) 

In every case we have a choice between exponential solutions and solu¬ 
tions of well-defined parity. Since our physical problem is strongly asymmet¬ 
ric in that particles are fired in from negative x rather than equally from both 
sides, it is tempting to work with the exponential solutions of the tise rather 
than the solutions of well-defined parity. However, the algebra involved in 
solving our problem is much lighter if we use solutions of well-defined parity 
because then the conditions that ensure proper behaviour of the solution 
at x = a automatically ensure that the solution also behaves properly at 
x = —a; if we use exponential solutions, we have to deal with the cases 
x = ±o individually. Therefore we seek solutions of the form 

, , s _ J .Bsin(&:|a;| + 4 >) for |x| > a 

e '“ ' \ cos(Kx) or cosh(ATai) otherwise; 

f B' sin(fca; + cf)') for x > a (5.33) 

tpo{x) = < Asin(ATa;) or Asinh(A"a;) for |:r| < a, 

[ — B' sin(fc|a;| + 4>') otherwise, 


where A, B, B ', <f> and <j)' are constants. B,(j> and (j)' will be unambiguously 
determined by the conditions at x = ±a. These conditions will make B' 
proportional to A, which we treat as a free parameter. 

In our study of the bound states of potential wells in §5.1, the require¬ 
ment that the wavefunction vanish at infinity could be satisfied only for 
discrete values of E. These values of E (and therefore k and K) differed 
between the even- and odd-parity solutions, so all energy eigenfunctions au¬ 
tomatically had well-defined parity. In the case of a free particle, by contrast, 
we will be able to construct both an even-parity and an odd-parity solution 
for every given value of E. Linear combinations of these solutions Ve{x) and 
ip 0 (x) of well-defined parity are energy eigenfunctions that do not have well- 
defined parity. We now show that the sum these solutions of the tise with 
well-defined parity can be made to describe the actual scattering problem. 

In the solution we seek, there are no particles approaching from the 
right. Adding the even- and odd-parity solutions, we obtain at x > a a 
solution of the form 


ip e (x) + ipo(x) = B sin (kx + </>) + B' sin(fcx + <j/) 


pi kx . . p-i kx . . 

— ( Be i0 + B'e 1 * J - + B'e^ J . 


(5.34) 
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The condition that no particles are approaching from the right is 

B' = -BeW-®, (5.35) 


for then at x > a the solution becomes 


pi kx 

ipe(x) + ipo{x) = -^-Be l4 (l - e 2lW _<w ) (x > a), (5.36) 

which includes only particles moving to the right. At x < — a the solution is 
now 


ipe(x) + 4>o(x) = B sin {—kx + <j>) — B' sin {—kx + <f>) 


pi kx , \ p-i kx , \ 

— [-Ber i4 + B'e~ i4 J + — [Be 1 * - B'e i4 J 


= e lkx iBe~ H 


r. — lkX 


2 i 


Be } 4 (l + e : 


2i 


(5.37a) 

In the solution given by equations (5.36) and (5.37a) the incoming amplitude 
is i Be~' 4 , while the amplitudes for reflection and transmission are 


jJ 4 (1 + e 2iA4 ) (reflected) 

—e 1 ^ (l — e 2lA4 ) (transmitted), 


(5.37b) 


where 

Acj) = (j)’ - (j) (5.37c) 

is phase difference A <f> between the odd- and even-parity solutions at \x\ > 
a. From the ratios of the mod-squares of the outgoing amplitudes to that 
of the incoming amplitude i B we have that the reflection and transmission 
probabilities are 


P re fl = COS 2 (A (/)) Ptrans = Sm 2 (A (j)). (5.38) 

Thus A(j) determines the reflection and transmission probabilities. Notice 
that these formulae for the transmission and reflection probabilities have 
been obtained without reference to the form of the wavefunction at |ai| < a. 
Consequently, they are valid for any scattering potential V ( x) that has even 
parity and vanishes outside some finite region, here |a;| < a. 

The scattering cross section In the case that Vo < 0, so the scattering 
potential forms a potential well, the outgoing wave at x > a represents 
two physically distinct possibilities: (i) that the incoming particle failed to 
interact with the potential well and continued on its way undisturbed, and 
(ii) that it was for a while trapped by the well and later broke free towards 
the right rather than the left. We isolate the possibility of scattering by 
writing the amplitude of the outgoing wave as 1 + T times the amplitude 
of the incoming wave. Here the one represents the possibility of passing 
through undisturbed and T represents real forward scattering. From our 
formulae (5.37) for the amplitudes of the incoming and outgoing waves we 
have that 

T = \ (e 2i4 ' - e 2i0 ) - 1. (5.39a) 

If we similarly write the amplitude of the reflected wave as R times the 
amplitude of the incoming wave, then from the formulae above we have 

R = -I f e 2i4 ' + e 2i4 


(5.39b) 
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The total scattering cross section 5 is defined to be the sum of the prob¬ 
abilities for forward and backward scattering: 

o =\R\ 2 + \T\ 2 . (5.40) 

Now |i?| 2 is just the reflection probability -P re fl, and the transmission proba¬ 
bility is 

Ptrans = |1 + T| 2 = 1 + |T| 2 + T + T*, (5.41) 

SO 

a = P refl + P trans - 1 - T - T* = -(T + T*). (5.42) 

From equation (5.39a) we have an expression for the total scattering cross 
section in terms of the phase angles 

cr = 2 — cos(2()/) + cos(2(f>). (5.43a) 

The trigonometric identities 1 + cos 2cf> = 2 cos 2 <f> and 1 — cos 2(f) = 2 sin 2 <f> 
enable us to re-express the cross section as 

cr = 2 (sin 2 (f)' + cos 2 (£) . (5.43b) 


5.3.1 Tunnelling through a potential barrier 

Now consider the case Vq > E in which classical physics predicts that all 
particles are reflected. From equations (5.33), the conditions for both the 
wavefunction and its derivative to be continuous at x = a are 


cosh(ATa) = B sin (ka + <f>) 
K sinh(ATa) = Bk cos (ka + (f>) 

where 



( A sinh(A'a) = B' sin(fca + <f>') 

\ KAcosh(Ka) = B'kcos(ka + </)'), 

(5.44a) 


and K = 


12m(Vo — E) 


(5.44b) 


Dividing the equations of each pair into one another to eliminate the con¬ 
stants A, B and B' , we obtain 


tan(fca + </>) = (k/K) coth(A'a) or tan(/ca + (f)) = {k/K) tanh(A'a). 

(5.45) 

On account of the fact that for any x, tan(a: + 7 r) = tana:, the equations have 
infinitely many solutions for <f> and <t> that differ by r7r, where r is an integer. 
From equations (5.39) and (5.43a) we see that these solutions give identical 
amplitudes for reflection and transmission and the same value of the total 
scattering cross section cr. Hence we need consider only the unique values of 
<f> and (j>' that lie within ±7r/2 of —ka. 

Equations (5.38) show that the transmission and reflection probabilities 
are determined by the phase difference A (f> = <f>' — (f>. From (5.45) we have 

A (f> = arctan tanh(A'a)^ — arctan cotlr(A'a)^ . (5.46) 


Figure 5.12 shows the transmission probability sin 2 (A<^>) as a function of 
the energy of the incident particle for (2 mVoa 2 /h 2 ) 1 ^ 2 = 0.5,1 and 1.5. We 
see that for the most permiable of these barriers the transmission probability 
reaches 50% when the energy is less than a third of the energy, Vo, classically 
required for passage. On the other hand, the transmission probability is still 

5 This definition of the total scattering cross section only applies to one-dimensional 
scattering problems. See §12.3 for the definition of the total scattering cross section that 
is appropriate for realistic three-dimensional experiments. 
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Figure 5.12 The transmission probability for a particle incident on a potential-energy 
barrier of height Vo and width 2a as a function of the particle’s energy. The curves are 
labelled by the values of the dimensionless parameter (2 mVoa?/h 2 ) 1 / 2 . 

only 80% when E = \ r 0 and classically the particle would be certain to pass. 
A barrier of the same height but three times as thick allows the particle to 
pass with only 2% probability when E = Vo/3, and even when E = Vq the 
chance of passing this thicker barrier is only a third. 

When the barrier is high, Ka 1 so both t = tanh(ATa) and coth(A'a) = 
l/t are close to unity: 


t = tanh (Ka) = + *~ Ka - (1 - e" 2 ^) 2 - (5.47) 

Consequently, the arguments of the two arctan functions in equation (5.46) 
are similar and we can obtain an approximate expression for A <f> by writing 


arctan 



= arctan 


~ arctan 



Kt 


C t 2 



( k \ 1 k u 2 

\Kt) + 1 + {k/Kt) 2 Kt { ' t 


1 ), 


(5.48) 


where we have used the standard formula darctanx/da; = 1/(1 + x 2 ). Using 
equations (5.47) and (5.48) in equation (5.46), and we have 


A</) 


4e~ 2Ka 
Kt/k + k/Kt 


4 k 

~K 


e~ 2Ka 


(Ka » 1). 


(5.49) 


Thus the probability of passing the barrier, sin 2 (A^>), decreases like e 4Ka 
as the barrier gets higher. 


5.3.2 Scattering by a classically allowed region 

Now consider the case of scattering by the square potential (5.31) when 
E > Vo, so the region of non-zero potential is classically allowed. Physically 
that region could be a classically surmountable barrier (Vo > 0) or a potential 
well (Vo < 0). At |a?| < a the wavefunctions of well-defined parity are now 
either cos(Kx) or Asin(A"a) and from equation (5.33) the conditions for 
continuity of the wavefunction and its derivative at x = a are 

cos(Ka) = B sin (ka + 0) ) f Asm(Ka) = B' sin(fca + <f>') 

—AT sin(ATa) = Bk cos (ka + </>) J ( KA cos(Ka) = B'k cos(fca + </'), 

(5.50a) 
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Figure 5.13 The probability of reflection by three potential barriers of height Vo and 
three half-widths a as functions of E/Vq. The curves are labelled by the dimensionless 
parameter (2m Vo a 2 / Ti 1 ) 1 / 2 . 


where 


k = 


2 mE 


and K = 


12m(E — Vq) 


(5.50b) 


By dividing the second equation of each pair into the first we obtain equations 
that uniquely determine the two solutions: 


tan(ka+cj)) = —(k/K) cot(Ka) or tan(fca+<//) = (k/K) tan(/\a). (5.51) 


The points in Figure 5.13 at E > Vq were obtained by solving these equations 6 
for (j> and <fJ and then calculating the reflection probability cos 2 (</>' — </>), while 
the remaining points were obtained from equations (5.46) for E < Vo. We 
see that for all three barrier widths the reflection probability obtained for 
E > V o joins smoothly onto that for E < Vq . The reflection probability 
tends to zero with increasing E/Vo as we would expect, but its dependence 
on the thickness of the barrier is surprising: for E/Vo = 2 the thickest 
barrier has the lowest reflection probability. In fact, the reflection probability 
vanishes for E slightly larger than 2Vo and then increases at larger energies. 
Similarly, the probability for transmission through the next thickest barrier 
vanishes near E = 3.5Vo- The cause of this unexpected phenomenon is 
quantum interference between the amplitudes to be reflected from the front 
and back edges of the barrier, which cancel each other when the barrier is of 
a particular thickness. 

When the constant Vo in the potential (5.31) is negative, there is a po¬ 
tential well around the origin rather than a barrier. In the classical regime 
the probability of reflection is zero, but as Figure 5.14 shows, it is in general 
non-zero and is large near £l/|Vb| <C 1. The oscillations in the reflection prob¬ 
ability apparent in Figure 5.14 are caused by quantum interference between 
reflections from the two edges of the well. 


5.3.3 Resonant scattering 

In the limit that a barrier becomes very high, the probability that it reflects 
an incoming particle tends to unity. Consequently, a particle that encounters 
two high barriers (Figure 5.15) can bounce from one barrier to the other a 
great many times before eventually tunnelling through one of the barriers and 

6 An explicit expression (5.78) for the reflection probability in terms of ka and Ka and 
without reference to (f> or cf>' can be derived (Problem 5.10). This formula is useful when 
limiting cases need to be examined. 
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E/|Vo| 


Figure 5.14 The probability for reflection by square potential wells of depth |Vo|, The 
full curve is for (2nt\V\o‘ 2 /fr) 1 / 2 = 3 and the dashed curve is for a well only half as wide. 



Figure 5.15 Schematic of the potential-energy function V(x) experienced by an o-particle 
near an atomic nucleus. The short-range ‘strong’ force causes the particle’s potential 
energy to rise extremely steeply at the edge of the nucleus. The long-range electrostatic 
repulsion between the nucleus and the alpha particle causes V ( x ) to drop steadily as the 
o-particle moves away from the nucleus. 


Tp e =B sin( —kx+0) 

cos(kx) 

B sin(kx + 0) 

B’ sin( —kx+0’) 

A sin(kx) 

B’ sin(kx+0’) 


— a a 


Figure 5.16 A pair of (5-function potentials form a well within which a particle can be 
trapped. The forms taken by the wavefunctions of the stationary states of even (top) and 
odd parity are shown. 


escaping to infinity. This situation arises in atomic nuclei because the short- 
range ‘strong’ force confines charged particles such as protons and helium 
nuclei (a-particles) within the nucleus even though it would be energetically 
advantageous for them to escape to infinity: the electrostatic energy released 
as the positively charged particle recedes from the positively charged nucleus 
can more than compensate for the work done on the strong force in moving 
beyond its short effective range (Figure 5.15). Some types of radioactivity - 
the sudden release of a charged particle by a nucleus - are caused by these 
particles tunnelling out of a well that has confined them for up to several 
gigayears. We now use a toy model of this physics to demonstrate that there 
is an important link between the cross section for scattering by a well and the 
existence of long-lived bound states within the well. This connection makes 
it possible to probe the internal structure of atomic nuclei and ‘elementary’ 
particles with scattering experiments. 

We model the barriers that form the potential well by (5-function poten¬ 
tials, located at x = ±a: 


V ( x ) = Vs {<5(a; + a) + 6 (x — a)} with Vs > 0. 


(5.52) 
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Figure 5.17 The total scattering cross sections of double 5-function barriers as a function 
of the wavenumber of the incoming particle. The barriers are located at x = d=a. The full 
curve is for high barriers ( 2mV§a/h 2 = 40) while the dotted curve is for lower barriers 
(2 mV^a/ti 2 = 10). 

By integrating the TISE 


(5 - 53) 

for an infinitesimal distance across the location of the 5-function barriers 
in equation (5.52), we find that a barrier introduces a discontinuity in the 
gradient of the wavefunction of magnitude (cf. eq. 5.13) 


dip 

da’ 


Kip, 


where 



(5.54) 


Hence the energy eigenstates that will enable us to calculate scattering by a 
double-5-function system take the form of sinusoids at |x| < a and at \x\ > a 
that join continuously at x = ±a in such a way that their gradients there 
differ in accordance with equation (5.54) (Figure 5.16). 

At x = a the requirements on the even-parity solution ip e (x) that it be 
continuous and have the prescribed change in derivative, read 


B sin(fca + </>) = cos (ka) 
kBcos{ka + (p) = — ksm(ka) +Kcos(ka). 

Similarly the conditions on the odd-parity solution ip 0 (x) are 

B' sin(fca + <p') = Asin(fca) 
kB' cos (ka + <p') = kAcos(ka) + KAs'm(ka). 


(5.55a) 


(5.55b) 


Dividing one equation in each pair by the other to eliminate A, B and B' we 
obtain 

cot(fca + </>) = K/k — tan(fca) 

cot(fca + tp') = K/k + cot(fca). 

From these expressions and equations (5.38) we can easily recover the prob¬ 
ability sin 2 (</> / — (p ) that an incoming particle gets past both 5-function bar¬ 
riers. More interesting is the total scattering cross section, which is related 
to the phases by equation (5.43b). Figure 5.17 shows as a function of the 
wavenumber of the incoming particle the cross sections for barriers of two 
heights. The height of a barrier is best quantified by the dimensionless num¬ 
ber 2 mVsa/h 2 = Ka. The full curve in Figure 5.17 is for the case that 
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I\a = 40 and the dotted curve is for the case of a lower barrier such that 
Ka = 10. In each case the cross section shows a series of peaks. In the case 
of the higher barrier, these peaks lie near ka = mr/2, with n = 1,2,.... In 
the case of the lower barrier the peaks are less sharp and occur at slightly 
smaller values of ka. 

If the barriers were so high as to be impenetrable, the particle would have 
bound states with ka = mr/2, which is the condition for the wavefunction to 
vanish at x = ±a. Each peak in the scattering cross section is associated with 
one of these bound states. Physically, the scattering cross section is large 
near the energy of a bound state because at such an energy the particle can 
become temporarily trapped between the barriers, and after a delay escape 
either to the right or the left. 

When the barriers have only finite height, the state |trap) in which 
the particle is initially trapped in the well is not a stationary state, and 
its expansion in stationary states will involve states whose energies span a 
non-zero range, say (Eg — T/2, Eq + T/2). For simplicity we assume that 
|trap) has even parity, so it can be expressed as a linear combination of the 
even-parity stationary states |e; E): 

pEq-\-T/2 

|trap) = / dE a(E)\e-, E), (5.57) 

Je 0 - r/2 


where a(E) is the amplitude to measure energy E. Outside the well the 
wavefunction of this state is 

rEo+r /2 

V ? trap(*^) oc / dE a(E) sin(fca; + <p) (x > a). (5.58) 

J E 0 -T/2 

Below we shall find that when the well is very deep, <p becomes a sensitive 
function of E in the neighbourhood of particular ‘resonant’ energies. Then 
the sines in equation (5.58) cancel essentially perfectly on account of the 
rapidly changing phase <p(E). When the integral is small, there is negligible 
probability of finding the particle outside the well. 

The evolution of ip tra p with time is obtained by adding the usual factors 
e _ i Et / h in the integral of equation (5.58): 

pEo+r /2 

t/^trap (x, t) oc / dEa(E) sin(kx + <p) e lEt ^ h (x > a). (5.59) 

JE 0 -r/2 

After a time of order h/T the relative phases of the integrand at Eq—T/2 and 
Eq + r/2 will have changed by 7r and the originally perfect cancellation of 
the sines will have been sabotaged. The growth of the value of the integral, 
and therefore the wavefunction outside the well, signals increasing probability 
that the particle has escaped from the well. The more rapidly <p changes with 
E, the smaller is the value of T at which a negligible value of the integral in 
equation (5.58) can be achieved, and the smaller T is, the longer the particle 
is trapped. Thus sensitive dependence of the phases on energy is associated 
with long-lived trapped states, which are in turn associated with abnormally 
large scattering cross sections. Notice that in Figure 5.17 the peaks are 
narrower at small values of k because the smaller the particle’s energy is, the 
smaller is its probability of tunnelling through one of the barriers. 

The Breit—Wigner cross section We have seen that when particles are 
scattered by a model potential that contains a well, the total scattering cross 
section has narrow peaks. The physical arguments given above suggest that 
this behaviour is generic in the sense that it is related to the time it takes 
a particle to tunnel out of the well after being placed there. What we have 
yet to do is to understand mathematically how the fairly simple formulae 
(5.43b) and (5.56) generate sharp peaks in the energy dependence of a. An 
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Figure 5.18 The values of the phases <fi (full curve) and cf>' (dashed curve) from equations 
(5.56) as functions of the wavenumber of the incoming particle when the latter is scattered 
by the double 5-function well (eq. 5.52). These results are for the case 2 mVga/h 2 = 40. 


understanding of this phenomenon will motivate a simple analytic model of 
resonant scattering that is widely used in experimental physics. 

Figure 5.18 shows the values of the phases </> and <j)' that solve equation 
(5.56). For most values of k (and therefore E), the two angles are equal, so 
the sum sin 2 <t> + cos 2 <j> in equation (5.43b) is unity. The peaks in a occur 
where </> and qS' briefly diverge from one another at the integral-sign features 
in Figure 5.18, which we shall refer to as ‘glitches’. 

We are interested in the case K/k 1. Then for most values of ka 
the right sides of equations (5.56) are dominated by the first term, so the 
cotangent on the left is equal to some large positive value, and its argument 
lies close to zero. However each time ka/it approaches (2r + 1)7 t/ 2 with r 
an integer, the tangent in the first equation briefly overwhelms K/k and the 
right side changes from a large positive number to a large negative number. 
Consequently the argument of the cotangent on the left quickly increases to 
a value close to n. As ka increases through (2r + 1)7 t/ 2, the tangent instan¬ 
taneously changes sign, and the argument of the cotangent instantaneously 
returns to a small value. Examination of the third glitch in Figure 5.18 con¬ 
firms that </> rises rapidly but continuously by almost 7r and then suddenly 
drops by exactly 7r as this analysis implies. The abrupt rise in (/> is centred 
on the point at which ka + (f> = 7t/2, at which point <f> — —m because at a 
glitch ka ~ (2 r + 1)7t/ 2. Consequently, glitches in <fi are centred on points 
at which cos= 1. Meanwhile <j>' ~ — (2r + l)7r/2, so sin 2 (<j>') = 1 and 
equation (5.43b) gives a — 4. A very similar analysis reveals how the second 
of equations (5.56) generates glitches in </>'. 

Putting this argument on a quantitative basis, we Taylor expand tan(fca) 
around the resonant value of k, /cr, at which tan(fcfja) = K/kn . Then 

K/k — tan (ka) ~ —sec 2 (kna)a5k = — (1 + K 2 /k^)aSk, (5.60) 

where 5k = k — /cr. We also observe that glitches in <j) occur where ka — 
(2 r + l)7r/2, so 


,, cos(ka) cos(d>) — sin(fca) smch 

cot {ka + <f> )= . ^- ,, , . ~ - tan 0. 

sm(fca) cos cp + cos (ka) sm(0) 


With these approximations the first of equations (5.56) reads 


(5.61) 


tan(/> ~ (l + K 2 /k^/j aSk. 
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Figure 5.19 The total cross section for the scattering of neutrons by 238 U nuclei. (From 
data published in L.M. Bollinger, et al., Phys. Rev., 105, 661 (1957)) 


Equation (5.43b) now gives the cross section as 

<7 = 2 (sin 2 6' + cos 2 <f>) = 2 ( sin 2 6' H-=— j 

v ' V 1 +tan 2 <£ J 


~ 2 I sin 2 6' + 


(5.62) 


l + (l+A' 2 /fc|) 2 a 2 (<5/c) 2 , 

Thus in the vicinity of the resonant energy E R = h 2 k^/2m,, where 


Sk = -^—(E - E r ), 
n fc R 


(5.63) 


the cross section has the form 


a = constant 


2(T/2) 2 

(T/2) 2 + (E- E r ) 2 ’ 


(5.64a) 


where 


2 h 2 k R ^ 2?i 2 fcf, 
(1 + K 2 /k^)am I\ 2 am 


(5.64b) 


T has the dimensions of energy and is the characteristic width of the reso¬ 
nance. Experimental data for the energy dependence of cross sections are 
often fitted to the functional form defined by equation (5.64a), which is 
known as the Breit—Wigner cross section. Figure 5.19 shows a typical 
example. 

The dependence on energy of the phase (j) and the total scattering cross 
section a in the vicinity of a peak in cr is reminiscent of the behaviour near 
a resonance of a lightly damped harmonic oscillator (Box 5.1). 

By the uncertainty principle, the width T of the Breit-Wigner cross 
section (5.64a) corresponds to a time scale 


h K 2 am 

<R = r = 2hkl ' 


(5.65) 


A naive calculation confirms that t R is the timescale on which a particle 
escapes from the well. When a particle encounters a (5-function barrier, it is 
easy to show (Problem 5.11) that its probability of tunnelling through the 
barrier is 


-Ptun — 


4 + (K/k R y 


4(fc R /A') 2 for K > k R . 


(5.66) 
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Box 5.1: Analogy with a damped oscillator 


When a weakly damped harmonic oscillator is driven at some angular 
frequency ui, the phase of the steady-state response changes sharply in 
the vicinity of the oscillator’s resonant frequency wr. Specifically, if the 
oscillator’s equation of motion is 

x + jx + WrZ = F cos (uit), 

then the steady-state solution is x = X cos(ujt — </>), where 


X = 


F 

V(t4 - w 2 ) 2 + 


and 


<j> = arctan 



As the driving frequency approaches the resonant frequency from below, 
the phase lag <j> increases from near zero to 7r/2. As the driving frequency 
passes through resonance, <j> drops discontinuously to —7 t/ 2, and then in¬ 
creases to near zero as oj 2 — becomes large compared to ury. These 
results suggest a picture in which a quantum well is an oscillator that 
is being driven by the incoming probability amplitude. The oscillator’s 
level of damping is set by the well’s characteristic energy T of equa¬ 
tion (5.64b), and the form of the Breit-Wigner cross section of equation 
(5.64a) mirrors the Lorentzian form of the oscillator’s amplitude X. 


Hence the probability of remaining in the well after bouncing n times off the 
walls is 

Htrap = (1 - Ptun) 11 - (5.67) 

The particle moves from one barrier to the other in a time tf = 2am./Tikn 
and in this time the logarithm of Pt rap changes by ln(l — P tun ) — — Ptun, so 

Aln(P trap ).-^ = -l, (5.68) 

where £r is given by equation (5.65). Thus this simple physical argument 
confirms that h/T is the characteristic time for the particle to remain in the 
well. 


5.4 How applicable are our results? 

It seems unlikely that any real system has a discontinuous potential V(x), so 
our results are of practical interest only if sufficiently steep changes in V can 
be treated as discontinuous. We now investigate how abrupt a change in po¬ 
tential must be for results obtained under the assumption of a discontinuous 
potential to be applicable. 

When the wavefunction is evanescent (i.e. oc e ±Kx ) on one side of the 
discontinuity, our results carry over to potentials that change continuously: 
where E < V(x), the wavefunction is no longer a simple exponential but 
its phase remains constant and its amplitude decreases monotonically, while 
in the region E > V(x) the sinusoidal dependence ip(x) oc e lkx is replaced 
by some other oscillatory function of similar amplitude. Qualitatively the 
results we have obtained for particles confined by a step potential carry over 
to continuously varying potentials when E < V on one side of a region of 
varying potential. 

The situation is less clear when E > V (x) on both sides of the change 
in the potential. The relevance to such cases of our solutions for step po¬ 
tentials can be investigated by solving the tise numerically for a potential 
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Figure 5.20 The full curve shows the probability of reflection when a particle moves from 
x = — oo in the potential (5.69) with energy E = h 2 k 2 /2m and Vo = 0.7 E. The dotted 
line is the value obtained for a step change in the potential (Problem 5.4). 



Figure 5.21 Each curve shows the reflection probability when a particle with kinetic 
energy E encounters a region in which the potential V(x) smoothly changes to Vo over 
a distance 26, and then smoothly returns to zero; the change in V is given by equation 
(5.69) with x replaced by x-\-a, and the fall is given by the same equation with x replaced 
by —{x — a). The full curves are for ka = 30 and the dashed curves for ka = 15. The 
left panel is for barriers of height Vo = 0.7 E, while the right panel is for potential wells 
(Vo = —0.7 E). 


that changes over a distance that can be varied (Problem 5.15). Consider 
for example 


{ 0 for x < —b 

|[1 + sin(7ra;/26)] for |x| < b (5.69) 

1 for x > b, 

which changes from 0 to Vo over a distance 2b centred on the origin. Fig¬ 
ure 5.20 shows the probability of reflection when a particle with energy 
nk 2 /2m = Vo/0.7 encounters this rise in potential energy as it approaches 
from x = — oo. For kb <C 1, the probability of reflection is close to that ob¬ 
tained for the corresponding step potential (Problem 5.4), but it falls to very 
much smaller values for kb > 2. 7 Thus treating a rapid change in potential 
energy as a discontinuity can lead to a serious over-estimate of the reflected 
amplitude. 

Figure 5.21 shows reflection probabilities for particles of energy E that 
encounter a finite region of elevated or depressed potential energy as a func¬ 
tion of the sharpness of the region’s sides - the changes in potential energy 
occur in a distance 2b as described by equation (5.69). The left panel is for 
potential barriers of height 0.7 E and the right panel is for potential wells of 

7 The ‘WKBJ’ approximation derived in §11.6 provides an analytic approximation to 
the solution of the TISE when kb is significantly larger than 2n. 
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depth 0.7 E. The full curves are for the case in which the region’s half-width 
a satisfies ka = 30, where k is the wavenumber of the incoming and outgoing 
wavefunctions, while the dashed curves are for regions only half as wide. The 
left panel shows that when kb = 1, the reflection probability generated by a 
smooth barrier is smaller than that for a sharp step by a factor ~ 1.7, and 
when kb = 2 it is nearly a factor ten smaller than in the case of an abrupt 
barrier. The right panel shows that in the case of a potential well, even 
smaller values of kb are required for the assumption of an abrupt change in 
V(x) to be useful. Thus these results confirm the implication of Figure 5.20 
that modelling a change in potential energy by a sharp step is seriously mis¬ 
leading unless the half width of the transition region b satisfies the condition 
kb < 1. 

The amplitude to be reflected by a region of varying potential energy 
decreases rapidly with increasing half width b of the transition region, but 
the amplitudes to be reflected at the leading and trailing edges of a region 
of varying potential remain comparable. Consequently, destructive interfer¬ 
ence between these amplitudes is possible for all values of b. Moreover, the 
phases of the reflected amplitudes depend on b, so the plots of overall reflec¬ 
tion probability versus b in Figure 5.21 show there are regular nulls in the 
probability for reflection like those we see in Figure 5.14 for the probability 
for reflection by an abrupt potential well. 

We conclude that many qualitative features of results obtained with step 
potentials also hold for continuous potentials, but results obtained for step 
potentials with no classically forbidden region are quantitatively misleading 
when applied to continuous potentials unless the distance over which the 
potential changes is small compared to the de Broglie wavelength A = h/p 
of the incident particles. 

Let’s consider under what circumstances this condition could be satisfied 
for a stream of electrons. The de Broglie wavelength of electrons with kinetic 
energy E is 


A = 1.16 



(5.70) 


For the one-dimensional approximation to apply, we need the beam to be 
many A wide, and for the step approximation to be valid, we require the 
change in potential to be complete well inside A. In practice these conditions 
can be simultaneously satisfied only when the potential change is associated 
with a change in the medium through which the electrons are propagating. 
If the medium is made of atoms, the change must extend over at least the 
characteristic size of atoms 0.1 nm. Hence we require E < 1 eV. 

Realistically step potentials are relevant only for less massive particles, 
photons and neutrinos. The propensity for some photons to be transmit¬ 
ted and some reflected at an abrupt change in potential, such as that at a 
glass/air interface, plays an important role in optics. By contrast, electrons, 
neutrons and protons are unlikely to be partially transmitted and partially 
reflected by a region of varying potential. 

These considerations explain why the phenomenon of partial reflection 
and partial transmission is unknown to classical mechanics, which is con¬ 
cerned with massive bodies that have de Broglie wavelengths many orders of 
magnitude smaller than an atom at any experimentally accessible energy. 
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5.5 Summary 

In this chapter we have examined some highly idealised systems and reached 
some surprising conclusions. 

• Any one-dimensional potential well has at least one bound state, and 
may have more depending on the size of the value of the dimensionless 
parameter W defined by equation (5.8), with Vo and a interpreted as 
the well’s characteristic depth and width, respectively. 

• A particle trapped by a very narrow or shallow well has negligible prob¬ 
ability of being found in the well. 

• When solving the tise in the presence of an infinite step in the potential, 
we should require the wavefunction to vanish at the base of the step. 

• When two identical square potential wells are separated by a barrier, 
the eigenenergies occur in pairs, and the associated wavefunctions have 
either even or odd parity with respect to an origin that is symmetrically 
placed between the wells. The even-parity state of a pair lies slightly 
lower in energy than the odd-parity state. A sum of the lowest two 
eigenstates is a state in which the particle is certainly in one well, while 
the difference gives a state in which the particle is certainly in the other 
well. A particle that starts in one well oscillates between the wells with a 
period inversely proportional to the difference between the eigenenergies. 
The particle is said to ‘tunnel’ through the barrier that divides the two 
wells at a rate that decreases exponentially with the product of the 
barrier’s height and the square of its width. 

• In an ammonia molecule the nitrogen atom moves in an effective poten¬ 
tial that provides two identical wells and the above model explains how 
an ammonia maser works. 

• A free particle has a non-zero probability to cross a potential barrier 
that would be impenetrable according to classical physics. On the other 
hand, if the potential changes significantly within one de Broglie wave¬ 
length, a particle generally has a non-zero probability of being reflected 
by a low barrier that classical physics predicts will be crossed. 

• The probabilities for a free particle to be reflected or transmitted by a 
potential barrier or a well with very steep sides oscillate as functions of 
the particle’s energy on account of quantum interference between the 
amplitudes to be reflected at the front and back edges of the barrier or 
well. 

• When a free particle is scattered by a region that contains a potential 
well, the total scattering cross section peaks in the vicinity of the energies 
of the well’s approximately bound states. Longer-lived bound states are 
associated with sharper peaks in a plot of scattering cross section versus 
energy because the width in energy of a peak, T, and the lifetime to of 
the corresponding bound state are related by the uncertainty relation 
t 0 T ~ h. 

• The Breit-Wigner formula (5.64a) gives the energy-dependence of a scat¬ 
tering cross-section near a resonance, and the timescale h/T that appears 
in it is the typical time for which a particle is trapped. 

• The results we have obtained for discontinuous potentials V{x) are an 
accurate guide to what will happen when the potential changes contin¬ 
uously when either (i) the particle is classically forbidden on one side 
of the change, or (ii) the change is complete within a fraction of a de 
Broglie wavelength. When the potential changes more gradually, the 
amplitude to be reflected by the region of changing potential is typically 
much smaller than in our idealised examples, but, as in these examples, 
the amplitude to be reflected oscillates as a function of energy. 
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Problems 

5.1 A particle is confined by the potential well 

V(x) = { 0 f ” 1*1 < a (5.71) 

1 oo otherwise. 

Explain (a) why we can assume that there is a complete set of stationary 
states with well-defined parity and (b) why to find the stationary states we 
solve the tise subject to the boundary condition if>(±a) = 0. 

Determine the particle’s energy spectrum and give the wavefunctions of 
the first two stationary states. 

5.2 At t = 0 the particle of Problem 5.1 has the wavefunction 

ib(x) = { for 1*1 < a (5.72) 

10 otherwise. 

Find the probabilities that a measurement of its energy will yield: (a) 
9h 2 ir 2 / (8ma 2 ); (b) 16?i 2 7r 2 /(8TOa 2 ). 

5.3 Find the probability distribution of measuring momentum p for the 
particle described in Problem 5.2. Sketch and comment on your distribution. 
Hint: express (p\x) in the position representation. 

5.4 Particles move in the potential 


V(x) 


0 for x < 0 
Vo for x > 0. 


(5.73) 


Particles of mass m and energy E > Vq are incident from x = — oo. Show 
that the probability that a particle is reflected is 


fk-K 
\k + K 


2 


(5.74) 


where k = \J 2 mE/Ti and K = ^J2m(E — Vo)/h. Show directly from the 
TISE that the probability of transmission is 


4kK 
(k + K) 2 


(5.75) 


and check that the flux of particles moving away from the origin is equal to 
the incident particle flux. 

5.5 Show that the energies of bound, odd-parity stationary states of the 
square potential well 


V(x) 


0 for |x| < a 

Vq > 0 otherwise, 


(5.76) 


are governed by 


cot (ka) = —4 


1 W 2 
(ka) 2 


— 1 where W = 


2m Vq a 2 


and 


Show that for a bound odd-parity state to exist, we require 


k 2 = 2mE/h 2 . 

(5.77) 

W > tt/2. 
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Figure 5.22 The real part of the wavefunction when a free particle of energy E is scattered 
by a classically forbidden square barrier barrier (top) and a potential well (bottom). The 
upper panel is for a barrier of height Vo = E/ 0.7 and half width a such that 2 mEa 2 /Ti 2 = 1 . 
The lower panel is for a well of depth Vo = E/0.2 and half width a such that 2mEa 2 /h 2 = 
9. In both panels (2 mE/h 2 ) 1 / 2 = 40. 



Figure 5.23 A triangle for Prob¬ 
lem 5.10 


5.6 Show that the correctly normalised wavefunction of a particle trapped 
by the potential V(x) = — V$8{x) is ip(x) = y[Ke~ K ^, where I< = mVs/h 2 . 
Show that although this wavefunction makes it certain that a measurement 
of x will find the particle outside the well where its kinetic energy is nega¬ 
tive, the expectation value of its kinetic energy (Ex) = \mV 2 /U 2 is in fact 
positive. Reconcile this apparent paradox as follows: (i) show that for a 
narrow, deep potential well of depth Vg and half-width a, with 2Vda = Vs, 
kaesW = (2mVga 2 /Ti 2 ) 1 / 2 , while Ka ~ W 2 . (ii) Hence show that the con¬ 
tribution from inside the well to (Ex) is \ip(b)\ 2 Vs regardless of the value of 
a. Explain physically what is happening as we send a —> 0. 

5.7 Reproduce the plots shown in Figure 5.22 of the wavefunctions of par¬ 
ticles that are scattered by a square barrier and a square potential well. Give 
physical interpretations of as many features of the plots as you can. 

5.8 Give an example of a potential in which there is a complete set of 
bound stationary states of well-defined parity, and an alternative complete 
set of bound stationary states that are not eigenkets of the parity operator. 
Hint: modify the potential discussed apropos NH 3 . 

5.9 A free particle of energy E approaches a square, one-dinrensional po¬ 
tential well of depth Vo and width 2a. Show that the probability of being 
reflected by the well vanishes when Ka = mr/2, where n is an integer and 
K = (2 m(E + Vg)/Ti 2 ) 1 / 2 . Explain this phenomenon in physical terms. 

5.10 Show that the phase shifts 4> (for the even-parity stationary state) 
and (j)' (for the odd-parity state) that are associated with scattering by a 
classically allowed region of potential Vg and width 2a, satisfy 


tan(fca + 4>) = — (k/K) cot(ifa) and tan(fca + (j)') = (k/K) tan(ATa), 
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where k and K are, respectively, the wavenumbers at infinity and in the 
scattering potential. Show that 


P re fl = cos 2 (0' - 4>) 


(. K/k — k/K) 2 sin 2 (2iv a) 

( K/k + k/K ) 2 sin 2 (2 Ka) + 4 cos 2 (2A"a) 


(5.78) 


Hint: apply the cosine rule for an angle in a triangle in terms of the lengths 
of the triangle’s sides to the top triangle in Figure 5.23. 

5.11 A particle of energy E approaches from i<0a barrier in which the 
potential energy is V(x) = VsS(x). Show that the probability of its passing 
the barrier is 


Ptnn — 


1 + (K/2k) 2 


where k = 


2 mE 


2 mVs 

n 2 


(5.79) 


5.12 An electron moves along an infinite chain of potential wells. For 
sufficiently low energies we can assume that the set {|n}} is complete, where 
\n) is the state of definitely being in the n th well. By analogy with our 
analysis of the NH 3 molecule we assume that for all n the only non-vanishing 
matrix elements of the Hamiltonian are £ = (n\H\n) and A = (n ± l\H\n). 
Give physical interpretations of the numbers A and £. 

Explain why we can write 

OO 

H= ^2 £|n)(n| + A (\n)(n + 1| + \n + l)(n|). (5.80) 

n——oo 


Writing an energy eigenket | E) = ^) J1 a rl \n) show that 

a m (E - £) - A(a m+1 + a m _i) = 0. (5.81) 

Obtain solutions of these equations in which a m oc e lkm and thus find the 
corresponding energies E *.. Why is there an upper limit on the values of k 
that need be considered? 

Initially the electron is in the state 

W = ^2 » ( 5 - 82 ) 

where 0 < k <C 1 and 0 < A <C k. Describe the electron’s subsequent motion 
in as much detail as you can. 

5.13* In this problem you investigate the interaction of ammonia molecules 
with electromagnetic waves in an ammonia maser. Let |+) be the state in 
which the N atom lies above the plane of the H atoms and |—) be the state in 
which the N lies below the plane. Then when there is an oscillating electric 
field £ cos u>t directed perpendicular to the plane of the hydrogen atoms, the 
Hamiltonian in the |±) basis becomes 


H = 


( E + q£s cos ut 

\ -A 


_ ~ A V 

E — q£s cos ut J 


(5.83) 


Transform this Hamiltonian from the |±) basis to the basis provided by the 
states of well-defined parity |e) and |o) (where |e) = (|+) + |—))/ v / 2, etc). 
Writing 

|V>) = a e (t)e- iE ^ R |e) + flo (t)e- iB ^|o), (5.84) 

show that the equations of motion of the expansion coefficients are 


+ e - i(w+aJo)t J 

^ = -i fiae(i) + e - i(w_w o)‘), 


(5.85) 
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where fl = q£s/2h and u >o = ( E a — E e )/Ti. Explain why in the case of a 
maser the exponentials involving w + wq a can be neglected so the equations 
of motion become 

^ - -ifla o (t)e i( “-“ 0)t ; ^ = -i £la e {t)e~^- Uo)t . (5.86) 

Solve the equations by multiplying the first equation by an( j differ¬ 

entiating the result. Explain how the solution describes the decay of a popu¬ 
lation of molecules that are initially all in the higher energy level. Compare 
your solution to the result of setting cj = oj 0 in (5.86). 

5.14 238 U decays by a emission with a mean lifetime of 6.4 Gyr. Take the 
nucleus to have a diameter ~ 10~ 14 m and suppose that the a particle has 
been bouncing around within it at speed ~ c/3. Modelling the potential 
barrier that confines the a particle to be a square one of height Vo and width 
2a, give an order-of-magnitude estimate of W = (2mVoa 2 /h 2 ) 1 / 2 . Given 
that the energy released by the decay is ~ 4MeV and the atomic number 
of uranium is Z = 92, estimate the width of the barrier through which the 
a particle has to tunnel. Hence give a very rough estimate of the barrier’s 
typical height. Outline numerical work that would lead to an improved 
estimate of the structure of the barrier. 

5.15* Particles of mass ?n and momentum Tik at x < - 
potential 

{ 0 for x < —a 

^[1 + sin(7rx/2a)] for |x| < a 
1 for x > a, 

where Vo < h 2 k 2 /2m. Numerically reproduce the reflection probabilities 
plotted Figure 5.20 as follows. Let ipi = ip(xj) be the value of the wavefunc- 
tion at Xj = j A, where A is a small increment in the x coordinate. From 
the tise show that 


—a move in the 

(5.87) 


tpj ~ (2 - A 2 k 2 )ijjj + i - ijj j+ 2 , (5.88) 


where k = \j2m[E — V)/Ti. Determine ipj at the two grid points with the 
largest values of x from a suitable boundary condition, and use the recurrence 
relation (5.88) to determine ipj at all other grid points. By matching the 
values of i\) at the points with the smallest values of a; to a sum of sinusoidal 
waves, determine the probabilities required for the figure. Be sure to check 
the accuracy of your code when Vo = 0, and in the general case explicitly 
check that your results are consistent with equal fluxes of particles towards 
and away from the origin. 

Equation (11.40) gives an analytical approximation for ^ in the case 
that there is negligible reflection. Compute this approximate form of ip and 
compare it with your numerical results for larger values of a. 

5.16* In this problem we obtain an analytic estimate of the energy differ¬ 
ence between the even- and odd-parity states of a double square well. Show 
that for large 9, coth6* — tanh0 ~ 4e -2e . Next letting 6k be the difference 
between the k values that solve 


tan [r7r 


where 


k{b — a)] 



coth (yiE 2 — (ka) 2 ^j 
tanh /W 2 — (fca) 2 ^j 


even parity 

odd parity, 

(5.89a) 


W = 


2mV 0 a 2 

n 2 


(5.89b) 
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5.17 We consider the scattering of free particles of mass m that move in 
one-dimension in the potential V(x) = —WS(x), with W > 0. (a) For a well 
of finite depth Vo and width 2 a the condition on the phases (p and ft of the 
even- and odd-parity wavefunctions ip oc sin(A:a; + ft), etc, for free particles 
are 

k k 

tan (ka + ft) = — — cot(Ka) ; tan(fca + ft) = — — tan(ifa) 

1 \ 1 \ 


Show that in the limit a —> 0, Vq = W/2a ->oowe have tan </> —> —Tftk/mW 
and ft —> 0. Hence obtain the scattering cross section by the ^-function 


potential 

2 

a 1 + (hk/mW) 2 ' 


(5.92) 


(b) Re-derive the equation above for <fi by requiring that ip = sin(fc|x| + ft) 
satisfy the tise. Convince yourself that ip = sin(fca:) is also consistent with 
the tise. 

(a) The wavenumber k is constant as we send a —> 0 so ka —> 0. 
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Composite systems 


Systems often consist of more than one part. For example a hydrogen atom 
consists of a proton and an electron, and a diamond consists of a very large 
number of carbon atoms. In these examples of composite systems there is 
significant physical interaction between the component parts of the system 
- the electron moves in the electromagnetic field of the proton, and electro¬ 
magnetic forces act between the atoms in a diamond. But in principle there 
need be no physical interaction between the parts of a composite system: it is 
enough that we consider the sum of the parts to constitute a single system. 
For example ‘quantum cryptography’ exploits correlations between widely 
separated photons that are not interacting with each other, and in §7.5 we 
shall study a system that consists of two completely unconnected gyros that 
happen to be in the same box. Even in classical physics specifying the state 
of such a system is a complex business because in general there will be cor¬ 
relations between the parts of the system: the probability for obtaining a 
certain value for an observable of one subsystem depends on the state of the 
other subsystem. In quantum mechanics correlations arise through quantum 
interference between various states of the system, with the result that cor¬ 
relations are sometimes associated with unexpected and sometimes puzzling 
phenomena. 

In §6.1 we extend the formalism of quantum mechanics to composite 
systems. We introduce the concept of ‘quantum entanglement’, which is 
how correlations between the different parts of a composite system are rep¬ 
resented in quantum mechanics, and we find that subsystems have a propen¬ 
sity to become entangled. In §6.1.4 we discuss a thought experiment with 
entangled particles that Einstein believed demonstrated that quantum me¬ 
chanics is merely an incomplete substitute for a deeper theory. Experiments 
of this type have since been carried out and the results are inconsistent with 
a theory of the type sought by Einstein. In §6.2 we introduce the principal 
ideas of quantum computing, which is the focus of much current experimen¬ 
tal work and has the potential to revolutionise computational mathematics 
with major implications for the many aspects of our civilisation that rely 
on cryptography. In §6.3 we introduce the operator that enables us to drop 
unrealistic assumptions about our level of knowledge of the states of quan¬ 
tum systems and introduce the key concept of entropy. In §6.4 we show that 
thermodynamics arises naturally from quantum mechanics. In §6.5 we come 
clean about the intellectual black hole that lurks at the heart of quantum 
mechanics: the still unresolved problem of measurement. 
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At several points in the chapter we encounter fundamental questions 
about quantum mechanics with which experimental and theoretical physi¬ 
cists are currently wrestling. It is a remarkable feature of quantum mechanics 
that already the sixth chapter of an introduction to the subject can bring 
students to the frontier of human understanding. 


6.1 Composite systems 

Once we understand how to combine two systems A and B to make a com¬ 
posite system AB, we will be in a position to build up systems of arbitrary 
complexity, because we will be able to combine the system AB with some 
other system C to make a system ABC, and so on indefinitely. So we now 
consider what is involved in forming AB out of A and B. 

Suppose {| A; *)} and {|B; j)} are sets of states of A and B, respectively. 
Then the symbolic product |A;«)|B;j) is used to denote that state of the 
composite system in which A is in the state | A; i) and B is in the state |B; j): 
clearly this is a well defined state of AB. We express this fact by writing 

|AB; i,j) = |A;i)|B; j), (6.1a) 

where the label of the ket before the semicolon indicates what system is 
having its state specified, and the label after the semicolon enumerates the 
states. The Hermitian adjoint of equation (6.1a) is 

(AB;i, j\ = (A;i|(B;j|, ( 6 . 1 b) 

and we define the product of a bra and a ket of AB by the rule 

(AB; i', j'|AB; i,j) = <A;i'|A;*)<B;/|B;j>. ( 6 . 2 ) 


This rule is well defined because the right side is simply a product of two 
complex numbers. It is a physically sensible rule because it implies that the 
probability that AB is in the state i'j' is the product of the probability that 
A is in state i' and B is in state j': 

p(AB;^/) = |(AB;*',/|AB;f,j)l 2 = |(A;*'|A;f)| 2 |(B;/|B;j)| 2 

= P( A;i>(B;/). 1 j 

Any state of AB that like (6.2) can be written as a product of a state 
of A and a state of B is rather special. To see this, we consider the simplest 
non-trivial example, in which both A and B are two-state systems. Let |+) 
and |—) be the two basis states |A; z) of A and let |f) and ||) be the two 
basis states |B; j) of B - we shall call these the ‘up’ and ‘down’ states of B. 
We use these basis states to expand the states of the subsystems: 

|A) = a_|—} + a + |+) ; |B) = 6j.|4.) + 6 -|-|t), (6.4) 

so the state |AB) = | A) |B) of AB can be written 

|AB) = (a_|—} + o+|+)) (h\l) + ) /g 

= o_5||—)|4_) + )|t) + o+6|.|+)|4.) + a+&tl+)|t)- 

The coefficients in this expansion are the amplitudes for particular events 
- for example a-b 4 . is the amplitude that A will be found to be minus and 
B will be found to be down. From them we obtain a relation between the 
probabilities of finding A to be in its plus state and B to be either up or 
down: 

P +1 = l&tP 

p+i \h\ 2 ' 


( 6 . 6 ) 
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Now by Bayes’ theorem, the probability of finding B to be up given that A 
is plus is 


P( B;t|A;+) 


P +1 

p(A;+) 


P +1 

P+1 + P+l 


1 

i + m/m 


(6.7) 


With equation ( 6 . 6 ) this simplifies to 


p(B;t|A;+) = i+w- (6 ' 8) 

The key thing is that the right side of this expression makes no reference to 
subsystem A. Evidently, when the state |AB) of the composite system can 
be written as a product |A) |B) of states of the subsystems, the probability 
of finding B to be up is independent of the state of A. That is, the two 
subsystems are uncorrelated or statistically independent. Usually the 
states of subsystems are correlated and then the state of AB cannot be 
expressed as a simple product |A)|B). 

For example, suppose we have two vertical gear wheels, A with Aa 
teeth and B with Ab teeth. Then the state of A is specified by giving the 
amplitudes a,; that the z th tooth is on top of the wheel. The state of B is 
similarly specified by the amplitudes bj for each of its teeth to be uppermost. 
However, if both wheels are members of the same train of gears (as in a 
clock), the probability that the j th tooth of B is on top will depend on which 
tooth of A is uppermost. When the orientations of the wheels are correlated 
in this way, each of the AaAb configurations of the pair of wheels has an 
independent probability, pij. Specifically, when Aa = Ab, Pij will vanish 
except when i = j. If these gear wheels are uncorrelated because they are 
not meshed together, we need to specify only the Aa +Ab amplitudes a,; and 
bj. Once the wheels become correlated as a result of their teeth meshing, we 
have to specify AaAb amplitudes, one for each probability p. t j. 

We now assume that the sets {|A; z)} and {|B; j)} are complete for their 
respective systems and show that the set of states given by equation ( 6 . 1 a) 
for all possible values of i,j is then a complete set of states for the composite 
system. That is, any state |AB;^>) of AB can be written 


l AB ;V>) = 5Z c b'|A- B ;bj) = 5^Cii|A;i)| B ; j). (6.9) 

ij ij 


The proof involves supposing on the contrary that there is a state | AB; <j>) of 
AB that cannot be expressed in the form (6.9). We construct the object 

|AB;x) = |AB; <fi) - y^Cjj|AB;z, j) where Cy = (AB; i, j|AB; (j>). (6.10) 
ij 


This object cannot vanish or |AB; <^>) would be of the form (6.9). But when 
AB is in this state, the amplitude for subsystem A to be in any of the states 
|A;z) vanishes: 

^«A;z|<B;j|)|AB; X )=0. ( 6 - 11 ) 

3 

This conclusion is absurd because the set {|A;z)} is by hypothesis complete, 
so the hypothesised state | AB; <j>) cannot exist. Thus we have shown that a 
general state of AB is specified by AaAb amplitudes, just as the argument 
about gear wheels suggested. 

This result implies that the number of amplitudes required to specify the 
state of a composite system grows exceedingly rapidly with the complexity 
of the subsystems - for example, if Aa = Ab = 1000 , a million amplitudes 
are required to specify a general state of AB. By contrast only 2000 am¬ 
plitudes are required to specify a product state |AB) = |A)|B) because the 
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form of such a state automatically sets to zero all correlations between the 
subsystems. For a general state a large number of amplitudes are required 
to specify these correlations. 

Even when a state of AB is given by an expansion of the form (6.9) that 
involves N\Nb amplitudes, the states of A and B may not be correlated. To 
see this let |A ;i/j) = be the state of the subsystem A and let 

|B; <j>) = Y^f =i 6j|B ;j) be the state of subsystem B. Then the state of the 
composite system AB is 

|AB;x) = |A; V>)|B; (j>) = a ibj\A; z)|B; j). (6.12) 

ij 


The right side of this equation is identical to the right side of equation (6.9) 
except that c^- has been replaced with atbj. Thus equation (6.12) is an 
instance of the general expansion (6.9), but it is a very special instance: in 
general the expansion coefficients Cij , which can be thought of as the entries 
in an N\ x Nb matrix, cannot be written as the product of an iVA-dimensional 
vector with entries a; and an TVe-dimensional vector bj. To see that this is 
so, consider the ratio Cij /Cif of the matrix elements in the same row but 
different columns. When c,j can be expressed as the product of two vectors, 


we have 


c ij _ a ibj _ bj_ 

Cij' aibji bj' 


(6.13) 


so this ratio is independent of i. That is, when the state of AB can be written 
as the product of a state of A and a state of B, the expansion coefficients c^- 
are restricted such that every row of the matrix that they form is a multiple 
of the top row. Similarly, in this case every column is a multiple of the 
leftmost column (Problem 6.3). 

When the state of AB cannot be written as the product of a state of A 
and a state of B, we say that the subsystems A and B are entangled. As we 
have seen, the observables of entangled systems are correlated, so we could as 
well say that the subsystems are correlated. It is remarkable that correlations 
between subsystems, which are as evident in classical physics as in quantum 
mechanics, arise in quantum mechanics through the quintessentially quantum 
phenomenon of the addition of quantum amplitudes: states of AB in which 
subsystems A and B are correlated are expressed as linear combinations of 
states in which A and B are uncorrelated. The use of the word ‘entanglement’ 
reminds us that correlations arise through an intertwining of states that is 
inherently quantum-mechanical and without classical analogue. 

It may help to clarify these ideas if we apply them to a hydrogen atom. 
We work in the position representation, so we require the amplitude 


^(x e ,Xp) = (x e ,x p |^>) (6.14) 

to find the electron near x e and the proton near x p . Suppose that we have 
states 

«i(x e ) = (x e |uj) and Uj(x p ) = (x p | Uj) (6.15) 

that form complete sets for the electron and the proton, respectively. Then 
for any state of the atom, | if>), there are numbers such that 

ij 

Multiplying through by (x e ,x p | we obtain 


V>(x e ,x p ) = y 'cijUj(x e )Uj(xL p ). (6.17) 

ij 

The product of Ui and Uj on the right is no longer symbolic: it is an ordinary 
product of complex numbers. The quantity c,j is the amplitude to find the 
electron in the state \ui) and the proton in the state | Uj). 
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Box 6.1: Classical correlations 


It’s instructive to consider how we can represent correlations between 
two classical systems A and B. Let’s assume that each system has a 
finite number N of discrete states - they might be digital counters on an 
instrument panel. Then there are N 2 probabilities Cjk to specify. 

We can specify the state of A by giving N probabilities a 3 and 
similarly the state of B can be specified by probabilities b k ■ We might 
choose to express these in terms of their discrete Fourier transforms a a 
and bp, so 

JV-l JV-l 

a, = Y a a e 2 ™i' N ; b k = Y b ^ k/N 

o;—0 /3—0 


If A and B were uncorrelated, so Cj k = ajb k , the state of AB could be 
written 

c jk = Y d '*P e2nKaj+f>k)/N ’ (!) 

ck/3 

where 

da/3 dabft. (2) 

In the presence of correlations we can still represent Cj k as the double 
Fourier sum (1) but then c a p will not be given by the product of equation 
(2). Thus the mathematical manifestation of classical correlations can 
be very similar to quantum entanglement. The big difference is that in 
the classical case the expansion coefficients have no physical interpreta¬ 
tion: the basis functions used for expansion (here the circular functions 
e 2 maj/N ^ an( j |] ie expansion coefficients a a etc., will not be non-negative 
so they cannot be interpreted as probability distributions. In quantum 
mechanics these quantities acquire physical interpretations. Moreover, 
the final probabilities, being obtained by mod-squaring a sum like that 
of equation (1), involve quantum interference between different terms in 
the sum. 


6.1.1 Collapse of the wavefunction 

Consider again the composite system we introduced above in which both A 
and B are two-state systems, with |—) and |+) constituting a basis for A and 
||) and |t) constituting a basis for B. Let AB be in the entangled state 

|AB) = o|+)|t) + l~)(d|t) + c ll))> (6.18a) 

where b and c are given complex numbers. Then if a measurement of sub¬ 
system A is made and it yields +, the state of AB after the system is 

|AB) = |+)|t). (6.18b) 

Conversely, if the measurement of A yields —, the state of AB after the 
measurement is 

|AB) = 1 |-)(6|t) + c|l)). (6.18c) 

V\ b \ + |c| 2 

These rules are extensions of the usual collapse hypothesis, which we intro¬ 
duced in §idealmeasuresec: there we had a single system and we stated that 
when a measurement is made, the state of the system collapses from a linear 
combination of states that are each possible outcomes of the measurement to 
the particular state that corresponds to the value of the observable actually 
measured. That is 


\ijj) = Y a i\ i ) I'*/’) = |3), say. 


(6.19) 
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The new twist in equations (6.18) is that when we expand the state of a 
composite system as a linear combination of states of the subsystem we 
propose to measure, the coefficients of those states are states of the other 
subsystem rather than amplitudes, and these states are the ones the second 
system will be in after the first system has been measured. Consequently, the 
amplitudes we obtain for a subsequent measurement of the second subsystem 
depend on the outcome of the first measurement: if measurement of A yields 
+, then from (6.18b), a measurement of B is certain to yield f, while if the 
measurement of A yields —, subsequent measurement of B will yield j. with 
probability l/(|6/c| 2 + l). 


6.1.2 Operators for composite systems 

While the law of multiplication of probabilities leads to the kets of subsystems 
being multiplied, we add the operators of subsystems. For example, if A and 
B are both free particles, then the Hamiltonian operator of the composite 
system is 

H AB =H A + ff B = /^ + 7 fB_. ( 6 . 20 ) 

2m A 2m B 

In this simple example there is no physical interaction between the parts of 
the system, with the consequence that the Hamiltonian splits into a part 
that depends only on the operators of A, and a part that depends only on 
operators of B. When there is a physical connection between the systems, 
there will be an additional part of the Hamiltonian, the interaction Hamil¬ 
tonian that depends on operators belonging to both systems. For example, 
if both particles bear electrostatic charge Q , the interaction Hamiltonian 


Hint = 


Q 2 

47re 0 |x A - x B | 


( 6 . 21 ) 


should be added to H A + H B to form H AB . For the rest of this subsection 
we assume for simplicity that there is no dynamical interaction between the 
subsystems. 

When an operator acts on a ket that is a product of one describing A 
and one describing B, kets that belong to the other system stand idly by as 
if they were mere complex numbers. For example 

p B \A;i)\B;j) = |A;i)(p B |B; j)) (6.22) 


SO 

(A; j'\(H A + H B )\A; i)|B; j) = (A; *'|H a |A; i)<B;/|B; j) 

+ <A;*'|A;i)<B;/|H B |B;j) 

= {A; i'\H a \A; i)Sjj> + 5u> (B-, j'\H B \B-, j). 

(6.23) 

When we set i' = i and j' = j we obtain the expectation value of H AB when 
the system is in the state | A; i) |B; j). This is easily seen to be just the sum of 
the expectation values of the energies of the two free particles, as one would 
expect. 

We shall several times have to find the eigenvalues and eigenkets of an 
operator such as H AB that is the sum of operators H A and H B that belong to 
completely different subsystems. Every operator of subsystem A commutes 
with every operator of subsystem B. Consequently when H AB is given by 
equation (6.20), 

[H ab , H a ] = [H a + H B ,H A ] = 0. (6.24) 

That is, when there is no physical interaction between the subsystems, so 
Hab is just the sum of the Hamiltonians of the individual systems, H AB 
commutes with both individual Hamiltonians. It follows that in this case 
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there is a complete set of mutual eigenkets of Hab, Ha and Hr- Let {j A; *)} 
be a complete set of eigenkets of Ha with eigenvalues Ef-, and let (|B;i)} 
be a complete set of eigenkets of Hr with eigenvalues Ej. Then it is trivial 
to check that the states |AB;*,j) = |A;*)|B;j) are eigenkets of Hab with 
eigenvalues Ef- + U®. Moreover, we showed above that these product kets 
form a complete set. So the states |AB; ij) form a complete set of mutual 
eigenkets of Hab, Ha and Hb- In the position representation this result 
becomes the statement that the wavefunctions 

V^ B (x a ,x b ) = (x A , x b |AB; ij) = itf (x A )u B (x B ) (6.25) 

form a complete set of mutual eigenfunctions for the three operators. That 
is, if we have a composite system with a Hamiltonian that is simply the sum 
of the Hamiltonians of the parts, we can assume that the eigenfunctions of 
the whole system’s Hamiltonian are simply products of eigenfunctions of the 
individual component Hamiltonians. 

It is instructive to write the tdse for a composite system formed by two 
non-interacting subsystems: 

.*0|AB> . t 0, |AX|D „ .* ( d \ K )^\ , ,a\ S |B)^ 

lh — = lh di { |A)|B)) = lh + |A) ^rJ 

= (i? A |A))|B) + |A)(7 J b |B)) = (H a + fls)|A>|B> (6 ' 26) 
= H ab |AB). 

Thus we have been able to derive the tdse for the composite system from 
the TDSE for each subsystem. Notice that the physically evident rule for 
adding the Hamiltonians of the subsystem emerges as a consequence of the 
ket for the whole system being a product of the kets of the subsystems and 
the usual rule for differentiating a product. 


6.1.3 Development of entanglement 

Entangled is an appropriate name because subsystems are as prone to become 
entangled as is the line of a kite. To justify this statement, we consider the 
dynamical evolution of a composite system AB. Without loss of generality 
we can use basis states that satisfy the tdses of the isolated subsystems. 
That is, the we may assume that the states | A; i), etc, satisfy 

ih^^ = HA\A-i) and ih ' 1 = H B \B;j). (6.27) 

A general state of the composite system is 

|AB)=Ec iJ -|A;i)|B;j), (6.28) 

ij 

where the expansion coefficients Cij are all functions of time. The Hamilto¬ 
nian of the composite system can be written 

Hab = H A + H b + H int , (6.29) 

where the interaction Hamiltonian H lnt is the part of Hab that contains 
operators belonging to both subsystems (cf eq. 6.21). Substituting this ex¬ 
pression for Hab and the expansion (6.28) into the tdse for the composite 
system (eq. 6.26), we find 


i h 


9|AB) 

dt 


(%^ A; *>l B ;i) + c u 

iA ^ 


( d\A -i) 
^ dt 


|B;j) + | A; *) 


«>)} 


{(H a |A; *»|B; j) + |A; i)(H B \B-,j)) + // int |A; i)\B;j)} . 
ij 

(6.30) 
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After using equations (6.27) to cancel terms, this simplifies to 

=E c ^^|A;*)|B;i), (6.31) 

ij ij 

which states that the time evolution of the expansion coefficients is entirely 
driven by the interaction Hamiltonian. In particular, if there is no coupling 
between the systems (H lnt = 0), the dj are constant, so if the systems are 
initially unentangled, they remain so. 

By multiplying equation (6.31) through by (A;fc|(B;Z| we obtain an 
equation that is most conveniently written 

= y^Cij(AB;fc?|ff int |AB;ij). (6.32) 

ij 

Let’s suppose that all the matrix elements in this equation vanish except 
an element (AB; fco(o|Rint|AB; Wo) which lies on the diagonal. Then only 
Cfc 0 ; 0 will have non-vanishing time derivative, so the condition for the sub¬ 
systems to be unentangled, namely that Cy/Cy/ is independent of i, which 
is initially satisfied, will soon be violated by the ratio Ck 0 i 0 /ck 0 j for j Iq. 
Careful consideration of what happens when there are several non-vanishing 
matrix elements leads to the same conclusion: almost any coupling between 
subsystems will cause them to become entangled from an unentangled initial 
condition. 

This result is not surprising physically: a coupling makes the motion 
of one system dependent on the state of the other. So after some time the 
state that the second system has reached depends on the state of the first 
system, which is just to say that the two systems have become correlated or 
entangled. 


6.1.4 Einstein Podolski Rosen experiment 

In 1935 A. Einstein, B. Podolski and N. Rosen (EPR for short) proposed 1 
an experiment with entangled particles that they argued would demonstrate 
that quantum mechanics is an incomplete theory in the sense that to specify 
the state of a physical system you need to know the values taken by hidden 
variables that quantum mechanics does not consider. In 1964 J.S. Bell 
showed 2 that for a similar experiment quantum mechanics makes predictions 
that are incompatible with the existence of hidden variables. In 1972 an 
experiment of this type was successfully carried out 3 and its results were 
found to vindicate quantum mechanics. We now describe Bell’s formulation 
of the experiment and discuss its implications. 

A nucleus decays from a state that has no spin to another spinless state 
by emitting an electron and a positron. The nucleus is at rest both before and 
after the decay, so the electron and positron move away in opposite directions 
with equal speeds. As we saw in §1.3.5, electrons and positrons are spinning 
particles so they each carry some spin angular momentum away from the 
nucleus. Since the nucleus is at all times without angular momentum, the 
angular momenta of the electron and positron must be equal and opposite. 
At some distance from the decaying nucleus Alice detects the electron and 
measures the component of its spin in the direction of her choice, a. As 
we saw in §1.3.5, the result of this measurement will be either or —i. 
Meanwhile Bob, who sits a similar distance from the nucleus to Alice, detects 
the positron and measures its spin in the direction of his choice, b. 

After Alice has obtained +i on measuring the spin along a she thinks: 
“If Bob measures along a too, he must measure — i. But if Bob measures 

1 E. Einstein, B. Podolski & N. Rosen, Phys. Rev.. 47, 777 (1935) 

2 J.S. Bell, Phyics, 1, 195 (1964) 

3 S.J. Freedman & J.F. Clauser, Phys. Rev. L., 28, 938 (1972) 
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along some other vector b, I cannot be certain what value he will get, but he 
isn’t likely to get + ^ if b is only slightly inclined to my vector a, that is, if 
1 — a - b <C 1.” Alice can see that conservation of angular momentum implies 
that the results obtained by Bob and herself must be correlated. Let’s put 
this argument on a quantitative basis. 

In §7.5.1 we shall see that because the system formed by the electron- 
positron pair has no net angular momentum, its state can be written 

IV>) = -^2 d e +)l p_ ) “ l e -)l p +» • ( 6 - 33 ) 

Here |e+) is the state in which the component of the electron’s spin along 
the 2 -axis is certain to be +|, and similarly for |p±), etc. We are free to 
orient the 2 -axis parallel to Alice’s choice of direction a, so we do this. When 
Alice obtains +4, she collapses the system’s state into 

W) = |e+)|p->- (6-34) 

Before Alice’s measurement, when the state was given by equation (6.33), 
the amplitude for a measurement of the positron’s spin along a to yield + 4 
was 1 /\/2, but after the measurement equation (6.34) shows that it vanishes, 
just as Alice reasoned it would. To find the amplitude for Bob to measure for 
the positron +5 along another vector b, we recall equation equation (1.34)a 
from §1.3.5: 


|+,b) = sin(0/2)e 1 ^ 2 |p—) + cos(0/2)e “^ 2 |p+), (6.35) 

where 9 and <j) are the polar angles that give the orientation of b in a system 

in which a is along the 2 -axis. In particular 

cosd = a b. (6.36) 

Given that after Alice’s measurement the positron is certainly in the state 
|p—), it follows from equation (6.35) that the amplitude for Bob to measure 
along his chosen direction is (+,b|p—) = sin(0/2)e -1<?i / 2 . Mod-squaring 
this amplitude we find that the probability that Bob measures +\ is 

Pb(+|A+) =sin 2 (d/2), (6.37) 

which is small when a ~ b as Alice predicted. So quantum mechanics is 
consistent with common sense. 

We have supposed that Alice measures first, but if the electron and 
positron are moving relativistically, a light signal sent to Bob by Alice when 
she made her measurement would not have arrived at Bob when he made his 
measurement, and vice versa. In these circumstances the theory of relativity 
teaches us that the order in which the measurements are made depends on 
the velocity of the observer who is judging the matter. Consequently, for 
consistency the predictions of quantum mechanics must be independent of 
who is supposed to make the first measurement and to collapse the system’s 
state. It is easy to see from the discussion above that this condition is 
satisfied. 

What worried EPR was that after Alice’s measurement there is a di¬ 
rection in which Bob will never find + 4 for the positron’s spin, and this 
direction depends on what direction Alice chooses to use. This fact seems 
to imply that the positron somehow ‘knows’ what Alice measured for the 
electron, and the collapse of the system’s state from (6.33) to (6.34) seems 
to confirm this suspicion. Since relativity forbids news of Alice’s work on 
the electron from influencing the positron at the time of Bob’s measurement, 
EPR argued that the required information must have travelled out with the 
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positron in the form of a hidden variable which was correlated at the time 
of the nuclear decay with a matching hidden variable in the electron. 

The existence of hidden variables would explain the probabilistic nature 
of quantum mechanics (which Einstein intensely disliked) because the uncer¬ 
tain outcomes of experiments would reflect our ignorance of the values taken 
by the hidden variables; the uncertainty would be banished once a better 
theory gave us access to these variables. 

Bell’s inequality Remarkably, Bell was able to show that any hidden 
variable theory will yield a weaker correlation than quantum mechanics be¬ 
tween the measurements of Alice and Bob as functions of the angle 6 between 
their chosen directions. Let’s denote the results of Alice’s and Bob’s mea¬ 
surements by (Ta = ±5 and <tb = ±4 and calculate the expectation value 
of the product ctactb- There are just four cases to consider, so the desired 
expectation value is 

(ctactb) = 3;{Pa(+)Pb(+|A+) + Pa(—)Pb(—|A—) 

-Pa(+)Pb(—|A+)-P a (—)Pb(+|A—)}, j 

where Pa(+) is the probability that Alice obtains a\ = and Pb(— |A+) 
is the probability that Bob finds ctb = — \ given that Alice has measured 
a a = + 5 - Since nothing is known about the orientation of the electron 
before Alice makes her measurement 

Pa(+) = Pa(—) = i (6.39) 

We showed above (eq. 6.37) that Pb(+|A+) = sin 2 (0/2), so 

P b (-|A+) = 1 - P b (+|A+) = cos 2 (0/2). (6.40) 

Putting these results into equation (6.38) we have 

(ct A (7b) = |{sin 2 (0/2) - cos 2 (0/2)} = cos0 = -|a • b, (6.41) 

which agrees with Alice’s simple argument when a — ±b. 

Consider now the case that the result of measuring the electron’s spin 
in the direction a is completely determined by the values taken by hidden 
variables in addition to a. That is, if we knew the values of these variables, 
we could predict with certainty the result of measuring the component of 
the electron’s spin in the direction of any unit vector a and Alice is only 
uncertain what result she will get because she is ignorant of the values of 
the hidden variables. We consider the variables to be the components of 
some n-dimensional vector v, and have that the result of measuring the 
electron’s spin along a is a function cr e (v,a) that takes the values ±-( only. 
Similarly, the result of measuring the positron’s spin along a unit vector b 
is a function er p (v, b) that is likewise restricted to the values ±^. As Alice 
argued, conservation of angular momentum implies that 

cr e (v,a) = -cr p (v, a). (6.42) 

The outcome of a measurement is uncertain because the value of v is uncer¬ 
tain. We quantify whatever knowledge we do have by assigning a probability 
density p(v) to v, which is such that the probability that v lies in the in¬ 
finitesimal n-dimensional volume d"v is dP = p(v) d"v. In terms of p the 
expectation value of interest is 


(cr e (a)cTp(b)) = j d”vp(v)cr e (v,a)cr p (v,b) 

= -/d»vp(vK(v,aK(,b), 


(6.43) 



114 


Chapter 6: Composite systems 



Figure 6.1 For a family of choices 
of the vectors a, b and b', quantum 
mechanics predicts that the left side 
of Bell’s inequality (6.46) is larger 
than the right side, contrary to the 
prediction of any hidden-variable 
theory. 


where the second equality uses equation (6.42). 

Now suppose Bob sometimes measures the spin of the positron parallel 
to b' rather than b. Then the fact that of (v, b) = j allows us to write 


(cr e (a)CTp(b)) - (cre(a)crp(b')) = ^/d"vp(vVe(v,a)K(v.b) - cr e (v,b')} 

= ~ J d n vp(v)cr e (v,a)cr e (v, b){l - 4cr e (v,b)cr e (v, b')}. 

(6.44) 

We now take the absolute value of each side and note that the curly bracket 
in the integral is non-negative, while the product cr e (v, a)cr e (v, b) in front of 
it fluctuates between ±-j. Hence we obtain an upper limit on the value of 
the integral by replacing cr e (v, a)<r e (v, b) by j, and have 

|( 0 ’e(a)cTp(b)) — (<r e (a)(7 p (b , )}| < ± J d"v p(v){l - 4cr e (v, b)cr e (v, b')}. 

(6.45) 

We break the right side into two integrals. The first, f d n vp(v), evaluates 
to unity because p is a probability density, while changing b — > b' and 
a — > b in equation (6.43) we see that the the second integral evaluates to 
—d^e^o^b')). Hence we have that 

|(<7e(a)<7 p (b)) - (cr e (a)cr p (b , )}| < ± + (cr e (b)cr p (b')). (6.46) 

This is Bell’s inequality, which must hold for any three unit vectors a, b 
and b' if hidden variables exist. It can be tested experimentally as follows: 
for a large number of trials Alice measures the electron’s spin along a while 
Bob measures the positron’s spin along b in half the trials and along b' in 
the other half. From the results of these trials the value of the left side 
of equation (6.46) can be estimated. The value of the right side is then 
estimated from a new series of trials in which Alice measures the electron’s 
spin along b and Bob measures the positron’s spin along b'. 

An obvious question is whether Bell’s inequality is consistent with the 
quantum-mechanical result (cr e (a)cr p (b)) = —|a • b (eq. 6.41). When we 
substitute this expression into each side we get 

LHS = j|a • (b — b')| ; rhs = \(l - b • b'). (6.47) 

Let’s choose ab = 0 and b' = b cos </>+asin (f> so as we increase the parameter 
tp from zero to n/2 b' swings continuously from b to a. For this choice of b' 
we easily find that 


LHS = sin</>| j RHS = j(l — cos <f>). (6.48) 


These expressions for the left and right sides of Bell’s inequality are plotted 
in Figure 6.1: we see that the inequality is violated for all values of <j) other 
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than 0 and 7 r/ 2 . Thus the quantum-mechanical result is inconsistent with 
Bell’s inequality and is therefore inconsistent with the existence of hidden 
variables. 

Inequalities similar to (6.46) can be derived for systems other than spin- 
half particles, including pairs of entangled photons. Experiments with pho¬ 
tons have produced results that agree with the predictions of quantum me¬ 
chanics to sufficient precision that they violate the relevant Bell inequalities . 4 
Consequently, these experiments rule out the possibility that hidden variables 
exist. 

What general conclusions can we draw from the EPR experiment? 

• A measurement both updates our knowledge of a system and disturbs the 
system. Alice’s measurement disturbs the electron but not the positron, 
and gains her information about both particles. 

• Quantum mechanics requires wholistic thinking: when studying the 
EPR experiment we must consider the system formed by both parti¬ 
cles together rather than treating the particles in isolation. We shall 
encounter a more spectacular example of this requirement below in con¬ 
nection with ideal gases. 

• Many discussions of the EPR experiment generate needless confusion 
by supposing that after Alice has measured + 5 for the component of 
the electron’s spin parallel to a, the spin is aligned with a. We shall 
see in §7.4.2 that the electron also has half a unit of angular momentum 
in each of the x and y directions, although the signs of these other 
components are unknown when we know the value of s z . Hence the 
most Alice can know about the orientation of the spin vector is that it 
lies in a particular hemisphere. Whatever hemisphere Alice determines, 
she can argue that the positron’s spin lies in the opposite hemisphere. 
So if Alice finds the electron’s spin to lie in the northern hemisphere, 
she concludes that the positron’s spin lies in the southern hemisphere. 
This knowledge excludes only one result from the myriad of possibilities 
open to Bob: namely he cannot find s z = +^. He is unlikely to find + 1 
if he measures the component of spin along a vector b that lies close to 
the 3 axis because the hemisphere associated with this result has a small 
overlap with the southern hemisphere, but since there is an overlap, the 
result +5 is not excluded. Contrary to the claims of EPR, the results of 
Bob’s measurements are consistent with the hemisphere containing the 
positron’s spin being fixed at the outset and being unaffected by Alice’s 
measurement. 

• The experimental demonstration that Bell inequalities are violated es¬ 
tablishes that quantum mechanics will not be superseded by a theory 
in which the spin vector has a definite direction. In §7.4.1 we shall see 
that macroscopic objects only appear to have well defined orientations 
because they are not in states of well-defined spin. That is, the idea 
that a spin vector points in a well defined direction is a classical notion 
and not applicable to objects such as electrons that do have a definite 
spin. This idea is an old friend from which we part company as sadly as 
after studying relativity we parted company with the concept of univer¬ 
sal time. The world we grew accustomed to in playgroup is not the real 
world, but an approximation to it that is useful on macroscopic scales. 
The study of physics forces one to move on and let childish things go. 


4 e.g., W. Tittel et al., PRL, 81, 3563 (1998) 
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6.2 Quantum computing 

There’s an old story about a mathematician at the court of the Chinese em¬ 
peror. The mathematician had advised the emperor wisely and the emperor, 
wishing to express his gratitude in a manner worthy of his greatness, asked 
the mathematician to name the reward he would like to receive. “Oh great 
Emperor, your offer is too liberal for one who has rendered you such a slight 
service. Let a chess board be brought and one grain of rice be placed on the 
first square, two on the second, four on the third, eight on the fourth, and 
so on till every square of the board has received an allocation of rice.” The 
emperor was pleased by the modesty of the mathematician’s proposal and 
ordered it be done. Great was his shock and annoyance the next day when it 
was reported to him that all the rice in his great silos had proved insufficient 
to pay the mathematician his due. For 2 64 — 1 ~ 10 19 grains of rice would 
be needed to supply the 64 squares on the board. That’s ~ 10 12 tons of rice 
and vastly more than all the rice on the planet. 5 

What is the relevance of this old story for quantum mechanics? We have 
seen that a system made of two two-state systems has four basis states. If we 
add a further two-state system to this four-state composite system, we obtain 
a system with 2x4 = 8 basis states. By the time we have built a system 
from 64 two-state systems, our composite system will have 2 64 ~ 10 19 basis 
states. Sixty four two-state systems might be constructed from 64 atoms 
or even 64 electrons, so could be physically miniscule. But to calculate the 
dynamics of this miniscule system we would have to integrate the equations 
of motion of 10 19 amplitudes! This is seriously bad news for physics. 

The idea behind quantum computing is to turn this disappointment 
for physics into a boon for mathematics. We may not be able to solve 
10 19 equations of motion, but Nature can evolve the physical system, and 
appropriate measurements made on the system should enable us to discover 
what the results of our computations would have been if we had the time to 
carry them out. If this approach to computation can be made to work in 
practice, calculations will become possible that could never be completed on 
a conventional computer. 

The first step towards understanding how a quantum computer would 
work is to map integers onto the basis states of our system. In this context 
we refer to a two-state system as a qubit and call its basis states |0) and 
|1). A set of N qubits forms a register, which has a complete set of states 
of the form |cc) |cc') • • • \x"), where x, x' , etc., = 0,1 indicate the states of the 
constituent qubits. Now given a number in binary form, such as 7 = 4 + 2 + 
1 = 111, we associate it with the basis state of the register |0)... |0)11)11)11). 
In this way we establish a one to one correspondence between the integers 
0 to 2 n — 1 and the basis states of a register that comprises N qubits. We 
use this correspondence to establish a more compact notation for the basis 
states of the register, writing |7) instead of |0)... 10) 11) 11) 11), etc. 

This arrangement mirrors the correspondence in a classical computer 
between numbers and the states of a classical register formed by N classical 
two-state systems or bits. The crucial difference between quantum and 
classical registers is that whereas a classical register is always in a state that 
is associated with a definite number, the generic state | %p) of a quantum 
register is a linear combination of states that are associated with different 
numbers: 

2 N -i 

IV*) = c i !•?')• ( 6 - 49 ) 

3-0 

Thus nearly all states of a quantum register are not associated with individ¬ 
ual numbers but with all representable numbers simultaneously. We shall see 
that this ability of a single state of a quantum register to be associated with 

5 According to the International Rice Search Institute, in 2007 global rice production 
was 650 million tons. 
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a huge number of integers enables a quantum computer to conduct massively 
parallel computations. 

The central processor unit (CPU) of a classical computer is a pro¬ 
grammable mechanism that reads a number n from an input register and 
places the number f(n) into the output register, where / is the function 
that the CPU is currently programmed to evaluate. By analogy one might 
imagine that a quantum computer would consist of a quantum register and 
a programmable Hamiltonian H that would cause the state | n) to evolve in 
some specified time T into the state | fin)) = e~ lHT / n \n). Unfortunately this 
conception is flawed because this machine could not evaluate any function 
that took the same value on different arguments, so f(n) = f(m) = F, say, 
for some values n ^ m. To see why the computer could not evaluate such a 
function recall that the operator U = e~ lHT / h is unitary, so it has an inverse 
U U But we have U\n) = U\m) = |F), and if we apply U' to | F) we must 
get both \m) and |n), which is absurd. 

We get around this problem by making our quantum computer slightly 
more complex: we let it have two registers, a control register X and a 
data register Y. The computer then has a basis of states |x)|y), where x is 
the number stored in the control register and y is the number stored in the 
data register. We conjecture that we can find a Hamiltonian such that for 
any function / the state |x)|y) evolves in time T into the state \x)\y + f(x)). 
Adding the second register solves the problem we encountered above because 
applying U to \n)\y) we get \n)\y + F) which is a different state from what we 
get when we apply U to \m)\y), namely \m)\y + F): adding the extra register 
allows the computer to remember the state it was in before the machine 
cycle started, and this memory makes it logically possible for U' to restore 
the earlier state. 

Adding the second register may have demolished an objection to our 
original most naive proposal, but is it really possible to construct a time- 
evolution operator that would enable us to evaluate any function f{x)l This 
question is answered affirmatively in two stages. First one defines a handful 
of unitary operators U that perform basic bit manipulations on our registers, 
and shows that using a sequence of such operators one can perform any of 
the standard arithmetical operations, adding, subtracting, multiplying and 
dividing. Second, for each of these operators U one designs an experiment 
in which U gives the evolution of a two-state quantum system over some 
time interval. Currently many groups use photons as qubits, identifying 
|0) and |1) with either right- and left-handed circular polarisation, or with 
linear polarisation in two orthogonal directions. Other groups use electrons 
as qubits, identifying |0) and |1) as states in which the spin in some given 
direction is either i or — i. All such work with real qubits is extremely 
challenging and in its infancy, but it has already established that there is no 
objection in principle to realising the simple unitary operators that quantum 
computing requires. It is too early to tell what physical form qubits will take 
when quantum computing becomes a mature technology. Consequently, we 
leave to one side the question of how our operators are to be realised and 
focus instead on what operators we require and what could be achieved with 
them when they have been realised. 

The simplest computer has two one-qubit registers, with a basis of states 
|0)|0), |0)11), 11)|0) and 11)11) - we shall refer to basis states of a register with 
any number of qubits ordered thus by increasing value of the stored number 
as the computational basis. In the computational basis of our two-qubit 
system, the operator [7+ that performs addition (|x)|j/) —> |x)|j/-l-x)) has the 
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unitary matrix 6 


U + = 


(\ 0 0 O' 
0 10 0 
0 0 0 1 
Vo o i o, 


To justify this claim we note that 


/! 

0 

0 

°\ 

(a\ 

(a\ 

0 

1 

0 

0 

P \ = 

P\ 

0 

0 

0 

1 

7 I 

6 

Vo 

0 

1 

0/ 

\s) 

V7 / 


so U + causes the state of the computer 

\ip) = a|0}|0) + /?|0)|1) + 7|1)|0) + <5|1}|1) 


(6.50) 


(6.51) 


(6.52) 


to evolve into 


u+ IV’) = «|0)|0) + /3|0)|1) + 7 |1)|1) + 5|1)|0), (6.53) 


so the second qubit is indeed incremented by the first modulo 2. 

17+ is a simple example of an operator in which the state of the data 
register is changed in a way that depends on the state of the control register 
while the state of the control register stays the same. Such operators are 
called controlled-U operators. Another useful operator is the controlled- 
phase operator, which in the computational basis has the matrix 


U4, 


n 0 0 0 \ 
01001 
0010 
Vo 0 0 e^J 


( 6 . 54 ) 


Utf, has no effect on the first three states of the computational basis, and it 
multiplies the phase of the last state by e 1<?i . It is straightforward to show 
that 

U<j>\x)\y) =e lxv<t, \x)\y) (6.55) 

by checking that the two sides match for all four possible values of (x,y). 

It can be shown that any unitary transformation of an n-qubit register 
can be simulated if we augment 17+ and U$ with two operators that work 
on just one qubit. One of these extra operators is the phase operator 17^, 
which leaves |0) invariant and increments the phase of |1) by <f>: 


K\0) = | 0 > 1 

^|l)=e i 1l) j 




Ul\x)=e^\x) 




Ul = 


1 0 
0 e‘ 


4 • ( 6 - 56 ) 


The other single-ciubit operator that we need is the Hadamard operator, 
which in the computational basis, |0) 11), has the matrix 


C/„ = T(j (6.57) 

The Hadamard operator takes a state that represents a number, such as 10), 
and turns it into a state that is a linear combination of the two representable 
numbers: Uh| 0) = (|0) + \l))/s/2. Conversely, because 17^ = I so 17 h is 

6 Here x + y must be understood to mean x + y mod 2 because quantum computers 
like classical computers do arithmetic modulo one more than the largest number that they 
can store. 
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Figure 6.2 Schematic diagram to show how two Hadamard operators and two phase shift 
operators suffice to transform |0) into an arbitrary state of a qubit. 
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Figure 6.3 Evaluating / on every 
argument simultaneously. The top 
three qubits form the control reg¬ 
ister, which is initially in the state 
| 0 >. 


its own inverse, it turns these linear combinations of numbers into actual 
numbers: Uh (| 0 ) + | l ))/-/2 = | 0 ). 

Complex operations on qubits can be built up by sequences of phase and 
Hadamard operators and such sequences are conveniently described using 
the graphical notation of Figure 6.2. Each qubit is represented by a line 
along which the state of the qubit flows from left to right. In the simple 
example shown, the state |0) is converted by the first Hadamard operator to 
(|0) + 11))/\/2, and U^g converts this to 

_L(| O ) + e 2i0 |l». (6.58a) 

After the next Hadamard operator this becomes 

i (|0) + |1) + e 2i6> (|0) - |1») = H (1 + e 2 *) |0> + (1 ^ e 2i0 

= e 10 (cos0|O) — isin#|l)} . 

Finally, application of the phase-shift operator t/J +7r / 2 converts this to 

|t/>) = e 10 (cos0|O) + e 1 ^ sin0|l)) . (6.58c) 

By choosing the values of 6 and <f> appropriately, we can make \ip) any chosen 
state of the qubit. Thus the phase-shift and Hadamard operators form a 
complete set of single-qubit operators. 

If we apply a Hadamard operator to each qubit of an 2-qubit register 
that is initially in the state |0)|0), we get 

(C/H|0))(f/ H |0)) = i(|0) + |l))(|0) + |l)) 

= \ (11)11) + 11)|0) + |0)|1) + |0)|0)) (6.59) 

= 3(|3) + |2> + |1) + |0)). 

That is, by setting the register to zero and then applying a Hadamard oper¬ 
ator to each of its qubits, we put the register into a linear superposition of 
the states associated with each representable number. It is easy to see that 
this result generalises to n-qubit registers. 7 Using this trick we can simul¬ 
taneously evaluate a function on every representable argument, simply by 
evaluating the function on the state of the control register immediately after 
it has been processed by the Hadamard operators. Figure 6.3 illustrates this 
process, which is described by the equations 

7 In fact, applying Hadamard operators to the qubits of an n-qubit register when it is 
set to any number will put the register into a linear superposition of states associated with 
all representable numbers, but if the initial state of the register differs from 10), exactly 
half of the coefficients in the sum will be — 2 -71 / 2 and half +2 _n / 2 (Problem 6.6). 


Ill)} 


(6.58b) 
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Box 6.2: Deutsch’s algorithm 

Given a function f[x) that takes an n-bit argument and returns either 0 
or 1 , the exercise is to determine whether / is a constant or ‘balanced’ 
function. To this end we build a computer with an n-qubit control regis¬ 
ter and a single qubit data register. We set the control register to |0) and 
the data register to |1) and operate on every qubit with the Hadanrard 
operator [7 h- Then the computer’s state is 

2^(eV>)(|0}-|1». (1) 

' x=0 ' 

Now we evaluate the function / in the usual way, after which the com¬ 
puter’s state is 

^fei*><i/(*)>-|i+ /(*>>>)- (2) 

Given that f(x) = 0,1, it is straightforward to convince oneself that 
(| f{x)) — |1 + /(#))) = (— 1 )/C a; ) (| 0 ) — | 1 )) so the computer’s state can be 
written 

fe(- 1 > ,W l*>) (l»> - |!»' (3) 

We now operate on every qubit with Un for a second time. The data 
register returns to | 1 ) because Un is its own inverse, while the control 
register only returns to | 0 ) if we can take the factor (—l)/( x ) out of 
the sum over x, making the state of the control register a multiple of 
\ x )\ if / is ‘balanced’, half of the factors (—l)/( x ) are +1 and half —1 
and in this case Un moves the control register to a state \y) for y ^ 0 
(Problem 6 . 6 ). Hence by measuring the state of the control register, we 
discover whether / is constant or balanced: if the control register is set 
to zero, / is constant, and if it holds any other number, / is balanced. 


I°}I°}^^T 72 X Ml") ^ Aj E l*>l/M>- ( 6 - 60 ) 

x—0 x=0 

After the evaluation of /, the computer’s state depends on every possible 
value of /. So the state of a 64-qubit computer will depend on the 2 64 ~ 
10 19 possible values of /. By exploiting this fact, can we conduct massively 
parallel computations with just a pair of quantum registers? 

The question is, how can we learn about the values that / takes? An 
obvious strategy is to read off a numerical value X from the control register 
by collapsing each of its qubits into either the state | 0 ) or the state | 1 ). 
Once this has been done, the state of the composite system \x)\y) will have 
collapsed from that given on the right of (6.60) to \X)\f(X)), so f(X) can 
be determined by inspecting each of the qubits of the data register. The 
trouble with this strategy is that it only returns one value of /, and that for 
a random argument X. Hence if our quantum computer is to outperform a 
classical computer, we must avoid collapsing the computer’s state by reading 
its registers. Instead we should try to answer questions about / that have 
simple answers but ones that involve all the values taken by /. 

For example, suppose we know that f(x) only takes the values 0 and 1, 
and that it is either a constant function (i.e., either f(x) = 1 for all x, or 
f(x) =0 for all a:) or it is a ‘balanced function’ in the sense that f{x) = 0 for 
half of the possible values of x and 1 for the remaining values. The question 
we have to answer is “is / constant or balanced?” With a classical computer 
you would have to keep evaluating / on different values of x until either you 
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got two different values (which would establish that / was balanced) or more 
than half of the possible values of x had been tried (which would establish 
that / was constant). In Box 6.2 we show that from (6.60) we can discover 
whether / is constant or balanced with only a handful of machine cycles. 

The algorithm given in Box 6.2 is an extension of one invented by 
Deutsch 8 , which was an early example of how the parallel-computing po¬ 
tential of a quantum computer could be harnessed. Subsequently algorithms 
were developed that dramatically accelerate database searches 9 and the de¬ 
composition of large numbers into their prime factors. The usefulness of 
the internet depends on effective cryptography, which currently relies on 
the difficulty of prime-number decomposition. Hence by rendering existing 
cryptographic systems ineffective, the successful construction of a quantum 
computer would have a big impact on the world economy. 

Notwithstanding strenuous efforts around the world, quantum comput¬ 
ing remains a dream that will not be realised very soon. Its central idea is 
that the the integers up to 2 ^ — 1 can be mapped into the base states of an 
A r -qubit quantum register, so a general state of such a register is associated 
with all representable integers, and the time evolution of the register involves 
massively parallel computing. The field is challenging both experimentally 
and theoretically. The challenge for theorists is to devise algorithms that 
extract information from a quantum register given that any measurement 
of the register collapses its state and thus erases much of the information 
that was encoded in it before a measurement was made. Experimentally, the 
challenge is to isolate quantum registers from their environment sufficiently 
well that they do not become significantly entangled with the environment 
during a computation. We discuss the process of becoming entangled with 
the environment in the next section. 


6.3 The density operator 

To this point in this book we have assumed that we know what quantum 
state our system is in. For macroscopic objects this assumption is completely 
unrealistic, for how can we possibly discover the quantum states of the ~ 10 23 
carbon atoms in a diamond, or even the ~ 10 5 atoms in a protein molecule? 
To achieve this goal for a diamond, at least 10 23 observables would have 
to be measured, and the number would in reality be vastly greater because 
individual atoms would be entangled with one another, making the state of 
the diamond a linear combination of basis states of the form |ai)|a 2 )... |ajv)> 
where |a*) denotes a state of the z th atom. It is time we squared up to the 
reality of our ignorance of the quantum states of macro- and meso-scopic 
objects. 

Actually, we need to be cautious even when asserting that we know the 
quantum state of atomic-scale objects. The claim that the state of a system 
is known is generally justified by the assertion that a measurement has just 
been made, with the result that the system’s state has been collapsed into 
a known eigenstate of the operator of the given observable. This procedure 
for establishing the quantum state of a system is unrealistic in that it makes 
no allowance for experimental error, which we all know to be endemic in real 
laboratories: real experiments lead to the conclusion that the value of an 
observable is x ± y, which is shorthand for “the probability distribution for 
the value of the observable is centred on x and has a width of the order y.” 
Since the measurement leaves the value of the observable uncertain, it does 
not determine the quantum state precisely either. 

Let us admit that we don’t know what state our system is in, but conjec¬ 
ture that the system is in one of a complete set of states (|n)}, and for each 

8 D. Deutsch, Proc. R. Soc., 400, 97 (1985) 

9 L. K. Grover, STOC’96, 212 (1996) 
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value of n assign a probability p n that it’s in the state |n). 10 It’s important 
to be clear that we are not saying that the system is in the state 

\4>) = (6.61) 


That is a well-defined quantum state, and we are admitting that we don’t 
know the system’s state. What we are saying is that the system may be in 
state 11), or in state |2), or state |3), and assigning probabilities p\, p 2 , ■ ■ ■ 
to each of these possibilities. 

Given this incomplete information, the expectation value of measuring 
some observable Q will be pi times the expectation value that Q will have 
if the system is in the state 11), plus p 2 times the expectation value for the 
case that the system is in the state |2), etc. That is 


Q = ^2p n (n\Q\n), 

n 


(6.62) 


where we have introduced a new notation Q to denote the expectation value 
of Q when we have incomplete knowledge. When our knowledge of a system 
is incomplete, we say that the system is in an impure state, and corre¬ 
spondingly we sometimes refer to a regular state |-i/>) as a pure state. This 
terminology is unfortunate because a system in an ‘impure state’ is in a per¬ 
fectly good quantum state; the problem is that we are uncertain what state 
it is in - it is our knowledge of the system that’s impure, not the system’s 
state. 

It is instructive to rewrite equation (6.62) by inserting either side of Q 
identity operators I = JT \qj){qj\ that are made out of the eigenkets of Q. 
Then we have 

Q = '52pn(n\qk){qk\Q\qj)(qj\n) = ^ qjPn\{qj\n)\ 2 , (6.63) 

nkj nj 


where the second equality follows from Q\qj) = qj\qj) and the ortho normality 
of the kets | qj). Equation (6.63) states that the expectation value of Q is the 
sum of the possible measurement values qj times the probability p n \(qj\n )\ 2 
of obtaining this value, which is the product of the probability of the system 
being in the state | n) and the probability of obtaining qj in the case that it 
is. 

Now consider the density operator 

p = ^2p n \ n ){n\, (6.64) 


where the p n are the probabilities introduced above. This definition is rem¬ 
iniscent of the definition 

Q = Y^qj\Qj)(Qj\ (6-65) 

3 

of the operator associated with an observable (eq. 2.9). In particular, p is 
a Hermitian operator because the p n are real. It should not be considered 
an observable, however, because the p n are subjective not objective: they 
quantify our state of knowledge rather than hard physical reality. For exam¬ 
ple, if our records of the results of measurements become scrambled, perhaps 
through some failure of electronics in the data-acquisition system, our values 
of the p n will change but the system will not. By contrast the spectrum {qj} 


10 See Problem 6.9 for a different and more physically plausible physical assumption. 
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Box 6.3: Properties of Tr 


The trace operator Tr extracts a complex number from an operator. We 
now show that although its definition (6.67) is in terms of a particular 
basis {| to)}, its value is independent of the basis used. Let (|g_j)} be any 
other basis. Then we insert identity operators I = JT \qj){qj\ either side 
of A in Tr A = ^ n (n|4|fi) : 

TrA = ^(n\q j )(q j \A\q k )(q k \n) = y ^2(q j \A\q k ){q k \n){n\q j ) 

nik kin , . 

(!) 

= Y^ioj\ A \qi)> 

3 

where we have used I = J2 n l n )( n l and ( qk\qj ) = 5kj- 

Another useful result is that for any two operators A and B , 
Tr(AB) = Tr (BA): 


Tr(Ai?) = ^^(n|AB|n) = ^^(n\A\m)(m\B\n) 

n nm 

= y^(m|i?|n)(n|A|TO) = m\BA\m) = Tr(i?A). 

nm m 


( 2 ) 


By making the substitutions B —>• C and A —> AB in this result we infer 
that 


Tr(ABC) = Tr(CAB). (3) 


of Q is determined by the laws of nature and is independent of the complete¬ 
ness of our knowledge. Thus the density operator introduces a qualitatively 
new feature into the theory: subjectivity. 

To see the point of the density operator, we use equations (6.64) and 
(6.65) to rewrite the operator product pQ: 

PQ = '52Pnqj\n)(n\q j )(q j \. (6.66) 

nj 


When this equation is premultiplied by (to| and postmultiplied by | to) and 
the result summed over ?n, the right side becomes the same as the right side 
of equation (6.63) for Q. That is, 

Tr (pQ) = ^2(m\pQ\m) = Q, (6.67) 

m 

where ‘Tr’ is short for ‘trace’ because the sum over to. is of the diagonal ele¬ 
ments of the matrix for pQ in the basis (|u)}. Box 6.3 derives two important 
properties of the trace operator. 

Equation (6.64) defines the density operator in terms of the basis (|n)|. 
What do we get if we express p in terms of some other basis {| Q'y)} ? To find 
out we replace \n) by ^2j(qj\n)\qj) and obtain 


P = Pn{qj\n){n\q k ) \q 0 )(qk\ 

njk 

= '^2Pjk\qj)(qk\ where p jk = '^ i p n (q j \n)(n\q k ). 

jk n 


( 6 . 68 ) 


This equation shows that whereas p is represented by a diagonal matrix in 
the {]«■)} basis, in the {|gj)} basis p is represented by a non-diagonal matrix. 
This contrast arises because in writing equation (6.64) we assumed that our 
system was in one of the states of the set (|u)}, although we were unsure 
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which one. In general if the system is in one of these states, it will definitely 
not be in any of the states (1^)} because each \n) will be a non-trivial 
linear combination of kets | qj). Thus when p is expanded in this basis, the 
expansion does not simply specify a probability to be in each state. Instead 
it includes complex off-diagonal terms pjk = \ n )( n \9k) that have no 

classical interpretation. When we have incomplete knowledge of the state of 
our system, we will generally not know that the system is in some state of a 
given complete set, so we should not assume that the off-diagonal elements of 
p vanish. Never the less, we may safely use equation (6.64) because whatever 
matrix represents p in a given basis, p is a Hermitian operator and will have 
a complete set of eigenkets. Equation (6.64) gives the expansion of p in 
terms of its eigenkets. In practical applications we may not know what the 
eigenkets | n) are, but this need not prevent us using them in calculations. 

The importance of p is that through equation (6.67) we can obtain from 
it the expectation value of any observable. As the system evolves, these ob¬ 
servables will evolve because p evolves. To find its equation of motion, we 
differentiate equation (6.64) with respect to time and use the tdse. The 
differentiation is straightforward because p n is time-independent: if the sys¬ 
tem was in the state | n) at time t, at any later time it will certainly be in 
whatever state | n) evolves into. Hence we have 


dp 

dt 


(®\ n ) /I i . d(n\ \ 

n x ' 

• \j2 Pn ( H \ n )( n \ - \ n )( n \ H ) = 7 l -(Hp- P H ). 
n 


This equation of motion can be written more simply 




(6.69) 


(6.70) 


To obtain the equation of motion of an arbitrary expectation value Q = 
Tr(pQ), we expand the trace in terms of a time-independent basis {|a)} and 
use equation (6.70): 

= = P ~ P H )Q\ a ) = Tr (p[<3,#]), (6-71) 


where the last equality uses equation (3) of Box 6.3. Ehrenfest’s theorem 
(2.34) states that the rate of change of the expectation value Q for a given 
quantum state is the expectation value of [Q,H] divided by i h, so equation 
(6.71) states that when the quantum state is uncertain, the expected rate of 
change of Q is the appropriately weighted average of the rates of change of 
Q for each of the possible states of the system. 

Notice that the density operator and the operators for the Hamiltonian 
and other observables encapsulate a complete, self-contained theory of dy¬ 
namics. If we have incomplete knowledge of our system’s initial state, use 
of this theory is mandatory. If we do know the initial state, we can still use 
this apparatus by assigning our system the density operator 

P = IV’XV’I (6-72) 

rather than using the tdse and extracting amplitudes for possible outcomes 
of measurements. However, when p takes the special form (6.72), the use of 
the density operator becomes optional (Problem 6.8). 
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6.3.1 Reduced density operators 

We have seen that any physical interaction between two quantum systems 
is likely to entangle them. No man is an island and no system is truly 
isolated (except perhaps the entire Universe!) Consequently, a real system 
is constantly entangling itself with its environment. We now show that even 
if our system starts in a pure state, once it has entangled itself with its 
environment, it will be in an impure state. 

We consider a system that is comprised of two subsystems: A, which 
will represent our system, and B, which will represent the environment - 
the environment consists of anything that is dynamically coupled to our 
system but not observed in sufficient detail for its dynamics to be followed. 
Let the density operator of the entire system be 


pab = \ A 'i i )\ B '’i)Pw{A;k\(B-,l\. 

ijkl 


(6.73) 


Let Q be an observable property of subsystem A. The expectation value of 
Q is 

Q = Tr Qp 

= ^(A;rn|(B;n|Q | |A; i)|B; j)p ijk i{A- ft|(B; l\ J |A;m)|B;n) 

ran \ijkl j 

— ^ ' (A, 771 | Q |A, i) ^ ' Pinmni 
mi n 

(6.74) 

where the second equality exploits the fact that Q operates only on the 
states of subsystem A, and also uses the orthonormality of the states of each 
subsystem: (A;ft|A;m) = 5km , etc. We now define the reduced density 
operator of subsystem A to be 


PA 


= X^ B ; n l^ AB l B ! n ) = l A; *) ( ^Pinkn J (A; ft|, 


(6.75) 


where the second equality uses equation (6.73). In terms of the reduced 
density operator, equation (6.74) can be written 


Q = ^(A; to|Q pa |A; m) = TtQpa- (6.76) 

m 


Thus the reduced density operator enables us to obtain expectation values of 
subsystem A’s observables without bothering about the states of subsystem 
B. It is formed from the density operator of the entire system by taking the 
partial trace over the states of subsystem B (eq. 6.75). 

Suppose both subsystems start in well-defined states. Then under the 
tdse the composite system will evolve through a series of pure states 
and at time t, the density operator of the composite system will be (cf. 6.72) 

pAB = \ip,t)(ip,t\. (6.77) 

If the two subsystems have not become entangled, so \ip,t) = |A, t)|B,t), 
then the reduced density operator for A is 

p A = | A, t)( A, t\ ]T(B; i|B, t)( B, i|B; i) = |A, t)( A, t\, (6.78) 


where we have used the fact that the set {|B; i)} is a complete set of states for 
subsystem B. Equation (6.78) shows that so long as the subsystems remain 
unentangled, the reduced density operator for A has the form expected for 
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a system that is in a pure state. To show that entanglement will generally 
lead subsystem A into an impure state, we consider the simplest non-trivial 
example: that in which both subsystems are qubits. Suppose they have 
evolved into the entangled state 

M ^ (|A;0)|B;0) + |A;1)|B; 1)) (6.79) 

Then evaluating the trace over the two states of B we find 

p A = i(B; 0| (|A; 0)|B; 0) + |A; 1)|B; 1)) ((A; 0|(B; 0| + (A; 1|(B; 1|) |B; 0) 

+ i(B; 1| (|A; 0)|B; 0) + |A; 1)|B; 1)) ((A; 0|(B; 0| + (A; 1|(B; 1|) |B; 1) 

= 5 (IA; 0)(A; 0| + |A; 1)(A; 1|), 

(6.80) 

which is the density operator of a very impure state. Physically this result 
makes perfect sense: in equation (6.80) p\ states that subsystem A has 
equal probability of being in either | 0 ) or 11 ), which is consistent with the 
state (6.79) of the entire system. In that state these two possibilities were 
associated with distinct predictions about the state of subsystem B, but 
in passing from pab to p\ we have lost track of these correlations: if we 
choose to consider system A in isolation, we lose the information carried 
by these correlations, with the result that we have incomplete information 
about system A. In this case system A is in an impure state. So long as we 
recognise that A is part of the larger system AB and we retain the ability 
to measure both parts of AB, we have complete information, so AB is in a 
pure state. 

In this example system A represents the system under study and system 
B represents the environment of A, which we defined to be whatever is dy¬ 
namically coupled to A but incompletely instrumented. If, for example, A is 
a hydrogen atom, then the electromagnetic held inside the vessel containing 
the atom would form part of B because a hydrogen atom, being comprised 
of two moving charged particles, is inevitably coupled to the electromagnetic 
Held. If we start with the atom in its first excited state and the electro¬ 
magnetic Held in its ground state, then atom, field and atom-plus-field are 
initially all in pure states. After some time the atom-plus-field will evolve 
into the state 


IVt t) = oo(t)|A; 0 )|F; 1 ) + ai (i)|A; 1 )|F; 0), (6.81) 

where |A;n) is the n th excited state of the atom, while |F;n) is the state of 
the electromagnetic field when it contains n photons of the frequency asso¬ 
ciated with transitions between the atom’s ground and first-excited states. 
In equation (6.81), ao(t) is the amplitude that the atom has decayed to its 
ground state while a\(t) is the amplitude that it is still in its excited state. 
When neither amplitude vanishes, the atom is entangled with the electro¬ 
magnetic field. If we fail to monitor the electromagnetic field, we have to 
describe the atom by its reduced density operator 

PA = M 2 |A; 0)(A; 0| + M 2 |A; 1)(A; 1|. (6.82) 

This density operator indicates that the atom is now in an impure state. 

In practice a system under study will sooner or later become entangled 
with its environment, and once it has, we will be obliged to treat the system 
as one for which we lack complete information. That is, we will have to 
predict the results of measurements with a non-trivial density operator. The 
transition of systems in this way from pure states to impure ones is called 
quantum decoherence. Experimental work directed at realising the possi¬ 
bilities offered by quantum computing is very much concerned with arresting 
the decoherence process by weakening all couplings to the environment. 
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6.3.2 Shannon entropy 

Once we recognise that systems are typically in impure states, it’s natural 
to want to quantify the impurity of a state: for example, if in the definition 
(6.64) of the density operator, p 3 = 0.99999999, then the system is almost 
certain to be found in the state |3) and predictions made by assuming that 
the system is in the pure state |3) will not be much in error, while if the 
largest probability occurring in the sum is ICC , the effects of impurity will 
be enormous. 

A probability distribution {p,} provides a certain amount of information 
about the outcome of some investigation. If one probability is close to unity, 
the information it provides is nearly complete. Conversely, if all the probabil¬ 
ities are small, no outcome is particularly likely and the missing information 
is large. The question we now address is “what is the appropriate measure 
of the missing information that remains after a probability distribution {pi} 
has been specified?” 

Logic dictates that the required measure s(pi,... ,p n ) of missing infor¬ 
mation must have the following properties: 

• s must be a continuous, symmetric function of the pp 

• s should be largest when every outcome is equally likely, i.e., when 
Pi = 1/n for all i. We define 

=S « ( 6 ' 83 ) 

and require that s„+i > s n (more possibilities implies more missing 
information). 

• s shall be consistent in the sense that it yields the same missing informa¬ 
tion when there are different ways of enumerating the possible outcomes 
of the event. 

To grasp the essence of the last requirement, consider an experiment with 
three possible outcomes x\ , X 2 and x 3 to which we assign probabilities p± , P 2 
and P 3 , yielding missing information s(p\ ,P 2 , Pa)- We could group the last 
two outcomes together into the outcome £ 23 , by which we mean “either X 2 
or 2 : 3 ”. Then we assign a probability P 23 = P 2 + P 3 to getting £ 23 , giving 
missing information s(pi,P 23 )- To this missing information we have to add 
that associated with resolving the outcome X 23 into either X 2 or x 3 . The 
probability that we will have to resolve this missing information is P 23 , and 
the probability of getting X 2 given that we have £23 is P 2 /P 23 , so we argue 
that 

s{Pl,P 2 ,P 3 ) = s(pi,P 23 ) +P 23 S[—, — ) . (6.84) 

\p23 P23> 

This equation is readily generalised: we have n possible outcomes Xi,... ,x n 
with probabilities pi,... ,p n . We gather the outcomes into r groups and let 
2 /i be the outcome in which one of xi ,..., was obtained, 2/2 the outcome in 
which one of x^+i • • •, Xk 2 was obtained etc, and let denote the probability 
of the outcome yi. Then since the probability that we get x\ given that we 
have already obtained 2/1 is pi/w\, we have 


s(pi ,..., Pn) = s(wi, ...,w r ) + w 1 s(pi/wi,.. ■,Pk 1 /w 1 )+ 

- \-w r s{p n - kr /w r , . . . , Pn/Wr)- 


(6.85) 


Since s is a continuous function of its arguments, it suffices to evaluate 
it for rational values of the arguments. So we assume that there are integers 
ni such that pi = rii/N , where JT rij = N by the requirement that the 
probabilities sum to unity. Consider a system in which there are N equally 
likely outcomes, and from these form n groups, with n, possibilities in the 
i th group. Then the probability of the group is pi and the probability of 
getting any possibility in the i th group given that the z th group has come up, 
is 1/rij. Hence applying equation (6.85) to the whole system we find 


n 

s(l/N,..., 1/N) = s(pi, ... ,p n ) + y^pis(l/ni,..., 1/rii) (6.86) 

i 
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Box 6.4: Solving Equation (6.88) 

Let s(n) = s n . Then equation ( 6 . 88 ) is easily extended to 
s(mnr ■ ■ ■) = s(n) + s(m) + s(r) + • • •, 
so with n = m = r = ■ ■ • we conclude that 

s(n k ) = ks(n). 

Now let u, v be any two integers bigger than 1. Then for arbitrarily large 
n we can find m such that 


to In v to + 1 

n ~ Inn n 


u m < v n < u n 


Since s is monotone increasing, 

s(u m ) < s(v n ) < s(u m+1 ) => ms(u) < ns(v) < (in + l)s(u) 

to s(v) to +1 

=> — < -7-r <-• 

n s(u) n 

Comparing equation (1) with equation (2), we see that 


s(v) Inn 
s(u) lnu 


s(v) s(u) 
In v In u 


where e = s(u)/(n Inn) is arbitrary small. Thus we have shown that 
s(n) oc Inn. 


or with the definition (6.83) of s n , 

n n 

s(pi,- ..,Pn) = s N ~ Y,p iSni (tf = 5». (6.87) 

i i 

This equation relates s evaluated on a general argument list to the values 
that s takes when all its arguments are equal. Setting all the rii = m we 
obtain a relation that involves only s n : 

S n — Snm S m . (6.88) 

It is easy to check that this functional equation is solved by s n = A'Inn, 
where K is an arbitrary constant that we can set to unity. In fact, in Box 6.4 
it is shown that this is the only monotone solution of equation ( 6 . 88 ). Hence 
from equation (6.87) we have that the unique measure of missing information 
is 

n 

s(pi , • • ■ ,Pn) = In IV - y. Pi In m 

i 

= -J2p< lnpi ■ 

i 

Since every probability p t is non-negative and less than or equal to one, s is 
inherently positive. Claude Shannon (1916-2001) first demonstrated 11 that 
the function (6.89) is the only consistent measure of missing information. 
Since s(p) turns out to be intimately connected to thermodynamic entropy, 
it is called the Shannon entropy of the probability distribution. 

The Shannon entropy of a density operator p is defined to be 

s(p) = — Trplnp. (6.90) 

11 C.E. Shannon, Bell Systems Technical Journal, 27, 379 (1948). For a much fuller 
account, see E.T. Jaynes Probability Theory: the Logic of Science Cambridge University 
Press, 2003. 
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The right side of this expression involves a function, ln(:r) of the operator 
p. We recall from equation (2.20) that f(p) has the same eigenkets as p and 
eigenvalues /(A,), where \ are the eigenvalues of p. Hence 

s = -Tr(plnp) = - '^2(n\'^2p i \i)(i\'^2ln(p j )\j)(j\n) = -^p„lnp n . 

n i j n 

(6.91) 

Hence s is simply the Shannon entropy of the probability distribution {pi} 
that appears in the definition (6.64) of p. 


6.4 Thermodynamics 

Thermodynamics is concerned with macroscopic systems about which we 
don’t know very much, certainly vastly less than is required to define a 
quantum state. For example, the system might consist of a cylinder full 
of fluid and our knowledge be confined to the chemical nature of the fluid 
(that it is O 2 or CO 2 , or whatever), the mass of fluid, its volume and the 
temperature of the environment with which it is in equilibrium. In the 
canonical picture we consider that as a result of exchanges of energy with 
the environment, the energy of the fluid fluctuates around a mean U. The 
pressure also fluctuates around a mean value P, but the volume V is well- 
defined and under our control. 

Thermodynamics applies to systems that are more complex than bodies 
of fluid, for example to a quantity of diamond. In such a case the stress in the 
material is not fully described by the pressure, and thermodynamic relations 
involve also the shear stress and the shear strain within the crystal. If the 
crystal, like quartz, has interesting electrical properties, the thermodynamic 
relations will involve the electric field within the material and the polarisation 
that it induces. A fluid is the simplest non-trivial thermodynamic system and 
therefore the focus of introductory texts, but the principles that it illuminates 
are of much wider validity. For simplicity we restrict our discussion to fluids. 

To obtain relations between the thermodynamic variables from a knowl¬ 
edge of the system’s microstructure, we need to assign a probability pi to 
each of the system’s zillions of quantum states. We argue that the only ratio¬ 
nal way to assign probabilities to the stationary states of a thermodynamic 
system is to choose them such that (i) they reproduce any measurements 
we have of the system, and (ii) they maximise the Shannon entropy. Re¬ 
quirement (ii) follows because in choosing the {pi} we must not specify any 
information beyond that included when we satisfy requirement (i) - our prob¬ 
abilities must “tell the truth, the whole truth and nothing but the truth”. 
It is straightforward to show (Problem 6.16) that the pi that maximise the 
Shannon entropy for given internal energy 

U = Y, E *Pi ( 6 ' 92 ) 

stationary 
states i 

are given by 

Pi = (6.93a) 

where j3 = l/ik^T) is the inverse temperature and 

Y e ~ 0Ei - (6.93b) 

stationary 
states i 


The quantity Z defined above is called the partition function; it is man¬ 
ifestly a function of T and less obviously a function of the volume V and 
whatever other parameters define the spectrum {Pi} of the Hamiltonian. In 
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equation (6.93a) its role is clearly to ensure that the probabilities satisfy the 
normalisation condition JTp.j = 1- 

Since the probability distribution (6.93a) maximises the Shannon en¬ 
tropy for given internal energy, we take the density operator of a thermody¬ 
namic system to be diagonal in the energy representation and to be given 

by 

P =\ (6-94) 

stationary 
states i 

This form of the density operator is called the Gibbs distribution in honour 
of J.W. Gibbs (1839-1903), who died before quantum mechanics emerged but 
had already established that probabilities should given by equation (6.93a). 

The sum in equation (6.94) is over quantum states not energy levels. It 
is likely that many energy levels will be highly degenerate and in this case 
the sum simplifies to Z = y(„ g a e~^ Ea , where a runs over energy levels and 
g a is the number of linearly independent quantum states in level a. 

The expectation of the Hamiltonian of a thermodynamic system is 


H = Tr (Hp) = ^2(n\H^2pi\i)(i\n) = ^ p n E n = U, (6.95) 

n i n 

where we have used the definition (6.92) of the internal energy. Thus the 
internal energy U of thermodynamics is simply the expectation value of the 
system’s Hamiltonian. Another important expression for U follows straight¬ 
forwardly from equations (6.92) and (6.93): 


d In Z 






dp 


(6.96) 


We obtain an interesting equation using equation (6.93a) to eliminate 
the second occurrence of p n from the extreme right of equation (6.91): 


s = - Y, p n (-pE n - In Z) = PU + In Z. 

n 

In terms of the thermodynamic entropy 


S = kss 


and the Helmholtz free energy 

F = —knT In Z 

equation (6.97) can be written 


F = U~TS, 


(6.97) 


(6.98) 


(6.99) 


( 6 . 100 ) 


which in classical thermodynamics is considered to be the definition of the 
Helmholtz free energy. When we substitute our definition of F into equation 
(6.96), we obtain 


d(pF) 

dp 


dF dF 

F + P — = F-T —. 
P dp dT 


( 6 . 101 ) 


Comparing this equation with equation (6.100) we conclude that 


S=- 


dF 

dT' 


( 6 . 102 ) 


The difference of equation (6.92) between two similar thermodynamic 
states is 

dU = ^2{dp i E i +p i dE i ). (6.103) 
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Similarly differencing the definition S = —&B P* °f the thermody¬ 

namic entropy (eqns 6.91 and 6.98), we obtain 

(IS = -fc B '^2 ( ln Pi + 1 ) = -&B In k dp,, (6.104) 

i i 


where the second equality exploits the fact that {p,;} is a probability distri¬ 
bution so J^iPi = 1 always. By equation (6.93a), In pi = —Ei/(kBT) — In Z, 
so again using J] j; p, = 1, equation (6.104) can be rewritten 

TAS=Y^,Eidpi. (6.105) 

i 

If we heat our system up at constant volume, the Ei stay the same but the 
Pi change because they depend on T. In these circumstances the increase in 
internal energy, JT i?,dp,, is the heat absorbed by the system. Consequently, 
equation (6.105) states that TdS is the heat absorbed when the system is 
heated with no work done. This statement coincides with the definition of 
entropy in classical thermodynamics. 

Substituting equation (6.105) into equation (6.103) yields 

dU = TdS — PdV, (6.106a) 

where 

P=-^Pi^. (6.106b) 

i 

If we isolate our system from heat sources and then slowly change its vol¬ 
ume, the adiabatic principle (§11.1) tells us that the system will stay in 
whatever stationary state it started in. That is, the p,; will be constant while 
the volume of the thermally isolated system is slowly changed. In classical 
thermodynamics this is an ‘adiabatic’ change. From equation (6.104) we see 
that the entropy S is constant during an adiabatic change, just as classical 
thermodynamics teaches. 

Since dS = 0 in an adiabatic change, the change in U as V is varied 
must be the mechanical work done on the system, —PdV, where P is the 
pressure the system exerts. This argument establishes that the quantity P 
defined by (6.106b) is the pressure. 

Differentiating equation (6.100) for the Helmholtz free energy and using 
equation (6.106a) to eliminate dC7, we find that 


dP = -SdT — PdV. (6.107) 

From this it immediately follows that 



The first of these equations was obtained above but the second one is new. 

Equation (6.106a) is the central equation of thermodynamics since it 
embodies both the first and second laws of thermodynamics. This result es¬ 
tablishes that classical thermodynamics is a consequence of applying quan¬ 
tum mechanics to systems of which we know very little. Remarkably, physi¬ 
cists working in the first half of the 19 th century discovered thermodynamics 
long before quantum mechanics was thought of, using extremely subtle argu¬ 
ments concerning heat engines. Quantum mechanics makes these arguments 
redundant. Notwithstanding this redundancy, they continue to feature in un¬ 
dergraduate syllabuses the world over because they are beautiful. But then 
so are copperplate writing and slide rules, which have rightly disappeared 
from schools. 
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A possible explanation for the survival of thermodynamics as an in¬ 
dependent discipline is as follows. Equations (6.99), (6.100) and (6.108) 
establish that any thermodynamic quantity can be obtained from the depen¬ 
dence of the partition function on T and V. Unfortunately, this dependence 
can be calculated for only a very few Hamiltonians. In almost all practical 
cases we cannot proceed by evaluating Z. However, once we know that Z 
and therefore F and S exist , we can determine their functional forms from 
experimental data. For example, by measuring the heat released on cooling 
our system at constant volume to absolute zero, we can determine its entropy 
S = f dQ/T. Similarly, we can measure the system’s pressure as a function 
of T and V. Then by integrating equation (6.107) we can obtain F(T,V) 
and thus infer Z(T, V). In none of these operations is the involvement of 
quantum mechanics apparent, so engineers and chemists, who make exten¬ 
sive use of thermodynamics, are generally unaware that it is a consequence of 
quantum mechanics. Quantum mechanics provides us with relations between 
thermodynamics quantities but does not enable us to evaluate the quantities 
themselves. Evaluation must still be done with 19 th century technology. 

Although thermodynamics systems are inherently macroscopic, quan¬ 
tum mechanics plays a central role in determining their thermodynamic 
quantities because it defines the stationary states we have to sum over in 
(6.93b) to form the partition function. Before quantum mechanics was born, 
the thermodynamic properties of an ideal gas one composed of molecules 
that occupy negligible volume and interact only at very short range - were 
obtained by summing over the phase-space locations of each molecule inde¬ 
pendently. In this procedure there are six distinct states of a three-molecule 
gas in which there are molecules at the phase-space locations Xi, X 2 and X 3 : 
in one state molecule 1 is at xi, molecule 2 is at X 2 and molecule 3 is at 
x 3 , and a distinct state is obtained by swapping the locations of molecule 
1 and molecule 2, and so forth. Quantum mechanics teaches that the state 
of the gas is completely specified by listing the three occupied states, | 1 ), 

12) and |3) for it is meaningless to say which molecule is in which state. 
The classical way of counting states leads to absurd results even for gases at 
room temperature (Problem 6.22). At low-temperatures another aspect of 
classical physics leads to erroneous results: the low-lying energy levels of a 
gas are distributed discretely rather than continuously in E, with the result 
that specific heats always vanish in the limit T —> 0 (Nernst’s theorem; 
Problem 6.23), contrary to the prediction of classical physics. 

An important lesson to be learnt from the failure of classical physics to 
predict the properties of an ideal gas is the importance in quantum mechan¬ 
ics of thinking wholistically: we have to sum over the quantum states of the 
whole cylinder of gas, not over the states of individual molecules. This is 
analogous to the importance for understanding EPR phenomena of consid¬ 
ering the quantum system formed by the entangled particles taken together. 
In quantum mechanics the whole is generally very much more than the sum 
of its parts because there are non-trivial correlations between the parts . 12 


6.5 Measurement 

In §1.4 we asserted that the state of a system ‘collapses’ into one of the 
eigenstates | qfi) of the operator Q the instant we measure the observable Q. 
Consequently, the result of measuring Q is to leave the system in the well- 
defined quantum state \qj). It’s time to examine this collapse hypothesis 
critically. 

Superficially the collapse hypothesis is merely an assertion that mea¬ 
surements are reproducible in the sense that if we measure something twice 
in quick succession, we will obtain the same result: in § 2.1 | qj) was defined 

12 The origin of these correlations is the subject of §10.1. 
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to be the state in which measurement of Q was certain to yield the value 
qj , so if the measurement of Q is to be reproducible, the system has to be 
in the state \qj) immediately after the measurement. However, our system’s 
quantum state | ip) is supposed to describe the system’s real, physical state, 
not just our knowledge of it. So something physical must have happened to 
make \i/j) shift from the value it had before the measurement to the state 
| qj) ^ \ip) that it had just after the measurement was completed. Notice that 
the evolution from |?/>} to | qj) has not been derived from the tdse, which we 
have stated to be the equation that governs the time-evolution of | i/j). So 
this Copenhagen interpretation of quantum mechanics implies that every 
measurement leads to a momentary suspension of the equations of motion, 
so the system can be steered, by forces unspecified, into a randomly chosen 
state! This is not serious physics. We need to consider more realistically 
what is involved in making a measurement. 

A first step from Copenhagen towards the real world can be taken by 
recognising that since real measurements are associated with error bars, they 
will not leave the system in a state in which the result of a subsequent 
measurement is certain. It follows that a real measurement of Q will in 
general not leave the system in one of the states | qt) in which the result of a 
subsequent measurement is certain. That is, the collapse hypothesis is false. 

The Copenhagen interpretation does, however, contain a crucial insight 
into measurement by stressing that any measurement physically disturbs the 
system, so the system’s state after a measurement has been made is different 
from what it was earlier. In classical physics we may or may not have to 
worry about the disturbance of the system by the measuring process - for 
example, when we measure the positions of Jupiter’s moons by pointing a 
telescope at them, we don’t need to worry about disturbance caused by mea¬ 
surement. But when we measure the voltage across a resistor by connecting 
a galvanometer in parallel with it, we change the voltage by increasing the 
current through the circuit either side of the resistor. We minimise this dis¬ 
turbance by buying a galvanometer with the highest affordable impedance, 
and we estimate the magnitude of the effect and try to correct for it. When 
measurements are made on systems small enough for quantum mechanics 
to be relevant, the system will be significantly disturbed because we cannot 
make instruments of arbitrary sensitivity - quantum mechanics itself makes 
this impossible. So the Copenhagen interpretation is right to stress that 
post- and pre-measurement states are significantly different. 

Where the Copenhagen interpretation slips up is in supposing that the 
disturbance caused by a measurement can be taken into account without 
knowing anything about the measuring instrument that was used. Physically 
it is obvious that since the disturbance is caused by the instrument, we 
cannot hope to predict the evolution of the system without knowledge of the 
physical principles on which the instrument works, and the configuration it 
was in when the measuring process started. In fact, it’s astonishing that 
useful predictions can be extracted from a theory that fails to engage with 
these key questions! 

The Copenhagen interpretation makes progress through two stratagems. 
First it assumes that the builder of the measuring instrument has been clever 
enough to make an instrument that makes essentially reproducible measure¬ 
ments. This being so, the state in which the measuring instrument leaves 
the system must be one of the eigenstates of the observable’s operator. Fo¬ 
cusing on instruments that measure reproducibly is a shrewd move because 
instruments that do not yield reproducible readings are regarded as ‘noisy’ 
and tend not to be used. So the Copenhagen interpretation does make an 
assumption about the nature of the measuring instrument - that it is a good 
one, so it steers the system’s state to one of the |- and gets by without 
considering the detailed physics that actually does the steering. By declin¬ 
ing to consider the physics of the instrument, the theory remains general and 
able to produce results that apply to any instrument rather than a particular 
brand of electrometer, or whatever. 
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The second stratagem is to abandon causality and assert that the out¬ 
come of a measurement is inherently uncertain. It merely supplies probabili¬ 
ties of the various measurement outcomes. So while a differential equation is 
supplied with which to calculate with precision the evolution of the system’s 
state between measurements, the consequences of measurement are left to 
blind chance. This stratagem circumvents the failure to consider fully the 
nature of the measuring equipment, for (as we shall argue) it is the unknown 
state of this equipment at the start of the measuring process that makes the 
outcome of the measurement uncertain. 

A key insight of the Copenhagen interpretation is that the states in 
which the outcome of measuring one observable is certain are generally dif¬ 
ferent from the states in which the outcome of measuring a different observ¬ 
able is certain. That is, there are fundamentally incompatible observables 
in the sense that if you are certain what value you will measure for one ob¬ 
servable, you cannot be certain what value you will measure for the other 
observable. Since the states in which the outcomes of measurements are 
the eigenkets of an observable’s operator, and operators that do not have a 
complete set of mutual eigenkets do not commute, incompatible observables 
have non-commuting operators. There is nothing deeply mysterious about 
incompatible observables - it just happens that the act of measuring one 
observable drives the system into different states from those into which the 
system is driven by measuring the other observable. The key thing is to be 
clear that an observable is not an intrinsic property of the system, but a 
question we can ask of it. In general my particle has neither a position nor 
a momentum, but these are questions I can ask of it, and after the question 
has been asked, the particle will be (temporarily) in a special state that does 
have a well-defined position / momentum. 

The probabilistic outcome of a measurement introduces to physics a 
new feature of great consequence: irreversibility. After a measurement, it is 
impossible to determine what the state of the system was before the mea¬ 
surement was made. This is so because many different initial states of the 
system are consistent with measuring a particular value of the observable Q, 
and therefore causing the system to finish in a given state \qi). 

An instrument is itself a dynamical system, and its dynamics is governed 
by quantum mechanics. We make a measurement by putting the instrument 
‘into contact’ with our system - that is, we ensure that the instrument and 
the system are dynamically coupled by a non-negligible Hamiltonian. Once 
in contact, the instrument and our system together form a composite system, 
and, like all dynamically coupled subsystems, they soon become entangled. 
That is, the state of the instrument becomes correlated with that of the 
system. It is as a consequence of this entanglement that the instrument is 
able to show a reading that reflects the state of the system being measured. 

The instrument must be sufficiently macroscopic that it can be read 
by a human being - if it were microscopic, an instrument would be needed 
to measure it, and so on until eventually the macroscopic scale is reached. 
That is, an instrument that is not macroscopic can be considered part of the 
quantum system being studied and evolved with the tdse; if a measurement 
is to be made, at some point the entire quantum system has to interact 
with a macroscopic instrument. Anything macroscopic will be in an impure 
state (§6.3). Consequently, once interaction with a macroscopic instrument 
is established, the outcome of the experiment will be probabilistic, just as 
the Copenhagen interpretation asserts. 

A measurement will also be irreversible in the sense that one cannot 
compute the state that the system was in prior to the interaction because 
such a computation would require for the initial conditions complete knowl¬ 
edge of the instrument’s state after the measurement. 

This discussion shows that the collapse hypothesis is really a clever way 
to circumvent our unwillingness to follow the dynamics of system/instrument 
interaction. The failure to follow the interaction enables the theory to make 
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general statements that are valid irrespective of which devices are actually 
used for measurement, but in specific cases it should be possible to obtain 
a more complete understanding by properly considering the dynamics of 
the measuring instrument. Unfortunately, we probably need an extension 
to quantum mechanics to take this step, because a conventional quantum- 
mechanical theory of the measuring instrument will require us at some point 
to ‘observe’ the instrument using the collapse hypothesis, from which we are 
trying to escape: quantum mechanics is a theoretical arena from which the 
only exit to the real world is through the turnstile of the collapse hypothesis. 

We expect any extension of quantum mechanics that successfully in¬ 
cludes the act of measurement to be formulated in terms of density opera¬ 
tors, because incomplete knowledge of the state of our instruments certainly 
makes a major contribution to the uncertain outcome of measurements, and 
may be entirely responsible for it. 

Problems 

6.1 A system AB consists of two non-interacting parts A and B. The dy¬ 
namical state of A is described by |a), and that of B by |6), so |a) satisfies the 
tdse for A and similarly for | b). What is the ket describing the dynamical 
state of AB? In terms of the Hamiltonians Ha and Hb of the subsystems, 
write down the tdse for the evolution of this ket and show that it is au¬ 
tomatically satisfied. Do Ha and Hb commute? How is the tdse changed 
when the subsystems are coupled by a small dynamical interaction //j nt ? 
If A and B are harmonic oscillators, write down Ha, Hb- The oscillating 
particles are connected by a weak spring. Write down the appropriate form 
of the interaction Hamiltonian H; nt . Does Ha commute with Hint? Explain 
the physical significance of your answer. 

6.2 Explain what is implied by the statement that “the physical state of 
system A is correlated with the state of system B.” Illustrate your answer 
by considering the momenta of cars on (i) London’s circular motorway (the 
M25) at rush-hour, and (ii) the road over the Nullarbor Plain in southern 
Australia in the dead of night. 

Explain why the states of A and B must be uncorrelated if it is possible 
to write the state of AB as a ket |AB; -0) = |A; t/>i)|B; ip 2 ) that is a product 
of states of A and B. Given a complete set of states for A, {|A;z}} and a 
corresponding complete set of states for B, {|B; z)}, write down an expression 
for a state of AB in which B is possibly correlated with A. 

6.3 Given that the state |AB) of a compound system can be written as 
a product |A)|B) of states of the individual systems, show that when |AB) 
is written as JT - Cjj|A; z)|B; j) in terms of arbitrary basis vectors for the 
subsystems, every column of the matrix c,y is a multiple of the leftmost 
column. 

6.4 Consider a system of two particles of mass m that each move in one 
dimension along a given rod. Let |1; x) be the state of the first particle when 
it’s at x and |2; y) be the state of the second particle when it’s at y. A 
complete set of states of the pair of particles is {| xy)} = {|1; x)|2 ;y)}- Write 
down the Hamiltonian of this system given that the particles attract one 
another with a force that’s equal to C times their separation. 

Suppose the particles experience an additional potential 

V{x,y) = \C(x + y) 2 - (6.109) 

Show that the dynamics of the two particles is now identical with the dynam¬ 
ics of a single particle that moves in two dimensions in a particular potential 
4 > (x,2 /), and give the form of <b. 
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6.5 In §6.1.4 we derived Bell’s inequality by considering measurements by 
Alice and Bob on an entangled electron-positron pair. Bob measures the 
component of spin along an axis that is inclined by angle 6 to that used by 
Alice. Given the expression 

| —, b) = cos( 0 / 2 ) e irl> ^ 2 \~) — sin( 0 / 2 ) e -I< ^ 2 |+), ( 6 . 110 ) 

for the state of a spin-half particle in which it has spin — ^ along the direction 
b with polar angles (9, 4>), with |±) the states in which there is spin along 
the z-axis, calculate the amplitude Ab(—|A+) that Bob finds the positron’s 
spin to be — ^ given that Alice has found +1 for the electron’s spin. Hence 
show that Pb{— |A+) = cos 2 ( 0 / 2 ). 

6.6 Show that when the Hadamard operator Ua is applied to every qubit 
of an n-qubit register that is initially in a member | m) of the computational 
basis, the resulting state is 

1 2 n — l 

W) = a *\ x )' ( 6 . 111 ) 

"" x=0 

where a x = 1 for all x if to = 0 , but exactly half the a x = 1 and the other 
half the a x = — 1 for any other choice of m. Hence show that 

1 jj | . _f | 0 ) if all a x = 1 

2 n /2 H 2-~/ ax ' x ' | j to) ^ | 0 ) if half the a x = 1 and the other a x = — 1 . 

( 6 . 112 ) 

6.7 Show that the trace of every Hermitian operator is real. 

6.8 Let p be the density operator of a two-state system. Explain why p 
can be assumed to have the matrix representation 


P = 




(6.113) 


where a and b are real numbers. Let Eq and Ei > Eq be the eigenenergies of 
this system and |0) and |1) the corresponding stationary states. Show from 
the equation of motion of p that in the energy representation a and b are 
time-independent while c(t) = c(0)e lajt with u> = (Ex — E 0 )/h. 

Determine the values of a, b and c(t) for the case that initially the system 
is in the state 1 4>) = (|0) +11 ))/\/2. Given that the parities of |0) and |1) are 
even and odd respectively, find the time evolution of the expectation value 
x in terms of the matrix element (0|x|l). Interpret your result physically. 

6.9 In this problem we consider an alternative interpretation of the density 
operator. Any quantum state can be expanded in the energy basis as 

N 

hfc0> = X^e*“| „), (6.114) 

n=1 


where <j) n is real and p n is the probability that a measurement of energy will 
return E n . Suppose we know the values of the p n but not the values of the 
phases <j) n . Then the density operator is 

r 27r i 

p= pTwl^' (6.115) 

Jo 

Show that this expression reduces to ^2 n Pn\n)(n\. Contrast the physical 
assumptions made in this derivation of p with those made in §6.3. 

Clearly 1 1 [>; (f>) can be expanded in some other basis (|< 7 r)} as 

\^,ct>) = J2^e^\qr), (6.116) 

r 

where P r is the probability of obtaining q r on a measurement of the observ¬ 
able Q and the ?? r (</ ) ) are unknown phases. Why does this second expansion 
not lead to the erroneous conclusion that p is necessarily diagonal in the 
(|g r )} representation? 
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6.10 Show that the equation of motion of the density operator p is solved 

by 

p t =U(t)p 0 U\t), (6.117) 

where U(t) = e~ lHt / h is the time-evolution operator introduced in §4.3. 

6.11* Show that when the density operator takes the form p = 
the expression Q = Tr Qp for the expectation value of an observable can be 
reduced to {4>\Q\ip). Explain the physical significance of this result. For the 
given form of the density operator, show that the equation of motion of p 
yields 

10)(-01 = \^){<t>\ where \</>) = - H\ip). (6.118) 

Show from this equation that \<j>) = a\ip), where a is real. Hence determine 
the time evolution of | ip) given the at t = 0 \ip) = | E) is an eigenket of H. 
Explain why p does not depend on the phase of \ip) and relate this fact to 
the presence of a in your solution for 

6.12 The density operator is defined to be p = p a |a)(a|, where p a 

is the probability that the system is in the state a. Given an arbitrary 
basis (I*)} and the expansions |a) = JT a m\i ), calculate the matrix elements 
pij = (i\p\j) of p. Show that the diagonal elements pn are non-negative real 
numbers and interpret them as probabilities. 

6.13 Consider the density operator p = JA ■ pij\i){j\ of a system that is in 
a pure state. Show that every row of the matrix p^ is a multiple of the first 
row and every column is a multiple of the first column. Given that these 
relations between the rows and columns of a density matrix hold, show that 
the system is in a pure state. Hint: exploit the real, non-negativity of pn 
established in Problem 6.12 and the Hermiticity of p. 


6.14 Consider the rate of change of the expectation of the observable Q 
when the system is in an impure state. This is 


dQ 

dt 


= J2 Pn ^ n \Q l n ) 


(6.119) 


where p n is the probability that the system is in the state | n). By using 
Ehrenfest’s theorem to evaluate the derivative on the right of (6.119), derive 
the equation of motion ifidQ/dt = Tr (p[Q,H]). 

6.15 Find the probability distribution (p l5 ... ,p n ) for n possible outcomes 
that maximises the Shannon entropy. Hint: use a Lagrange multiplier. 


6.16 Use Lagrange multipliers A and /? to extremise the Shannon entropy 
of the probability distribution {pi} subject to the constraints (i) pi = 1 
and (ii) JApi-E) = U. Explain the physical significance of your result. 

6.17 Explain why if at t = 0 the density operator of a system is given by 
the Gibbs distribution, it remains so at later times. 


6.18 A composite system is formed from uncorrelated subsystem A and 
subsystem B, both in impure states. The numbers {pAi} are the probabilities 
of the members of the complete set of states {| A;«)} for subsystem A, while 
the numbers {pb?:} are the probabilities of the complete set of states (|B;i)} 
for subsystem B. Show that the Shannon entropy of the composite system is 
the sum of the Shannon entropies of its subsystems. What is the relevance 
of this result for thermodynamics? 


6.19 The |0) state of a qubit has energy 0, while the |1) state has energy e. 
Show that when the qubit is in thermodynamic equilibrium at temperature 
T = 1 /(Ab/ 3) the internal energy of the qubit is 


U = 


+ 1 ' 


( 6 . 120 ) 


Show that when /3e <C 1, U ~ |e, while for (3e 1, U ~ ee Interpret 

these results physically and sketch the specific heat C = dU/dT as a function 
of T. 
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6.20 Show that the time-evolution of the density operator leaves the Shan¬ 
non entropy s = — Tr p log p invariant. 

6.21 Show that the partition function of a harmonic oscillator of natural 
frequency w is 

e -0huj/2 

Zbo = i _ e -/3huj ■ (6.121) 

Hence show that when the oscillator is at temperature T = l/(kB0) the 
oscillator’s internal energy is 


t/ ho = hu 


1 

q(3Huj _ ^ 


( 6 . 122 ) 


Interpret the factor ( e^ hul — 1) 1 physically. Show that the specific heat 
C = dU/dT is 

e /3hu> 

C = fcB (e^-i ) 2 (/ 3 M 2 - (6-123) 

Show that liniT->.o C = 0 and obtain a simple expression for C when ksT 
hui. 

6.22 A classical ideal monatomic gas has internal energy U = §-ZV/cbT and 
pressure P = IV/cbT/V, where N is the number of molecules and V is the 
volume they occupy. From these relations, and assuming that the entropy 
vanishes at zero temperature and volume, show that in general the entropy 
is 

S(T,V) = Affc B (|lnT + lnV). (6.124) 

A removable wall divides a cylinder into equal parts of volume V. Initially 
the wall is in place and each half contains N molecules of ideal monatomic 
gas at temperature T. The wall is removed. Show that equation (6.124) 
implies that the entropy of the entire body of fluid increases by 2 In 2 Nkn. 
Can this result be squared with the principle that d<f> = d Q/T, where AQ is 
the heat absorbed when the change is made reversibly? What conclusion do 
you draw from this thought experiment? 

6.23 Consider a ‘gas’ formed by M non-interacting, monatomic molecules 
of mass m that move in a one-dimensional potential well V = 0 for |ar| < a 
and oo otherwise. Assume that at sufficiently low temperatures all molecules 
are either in the ground or first-excited states. Show that in this approxima¬ 
tion the partition function is given by 


In Z = -AtpEo + e~ 3/3Eo - e - 3(M+1)/3B ° where E 0 = 

8 ma 2 


(6.125) 


Show that for M large the internal energy, pressure and specific heat of this 
gas are given by 


U = E 0 (M + 3e~ 30Eo ) ; P = — (M + 3e~ 3,3Eo ) ; C v = ^%e~ 3,3Eo . 

a KbT z 

(6.126) 

In what respects do these results for a quantum ideal gas differ from the 
properties of a classical ideal gas? Explain these differences physically. 
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Angular Momentum 


In Chapter 4 we introduced the angular-momentum operators Ji as the gener¬ 
ators of rotations. We showed that they form a pseudo-vector, so J 2 = JA Jf 
is a scalar. By considering the effect of rotations on vectors and scalars, we 
showed that the the commute with all scalar operators, including J, and 
found that commutator of J; with a component of vector operator is given 
by equation (4.31). From this result we deduced that the Ji do not commute 
with one another, but satisfy [Ji, Jj\ = eijk.Jk- 

Although we have from the outset called the Ji ‘angular-momentum 
operators’, the only connection we have established between the Ji and an¬ 
gular momentum is tenuous and by no means justifies our terminology: we 
have simply shown that when the Hamiltonian is invariant under rotations 
about some axis a, and the system starts in an eigenstate of the correspond¬ 
ing angular-momentum operator a ■ J, it will subsequently remain in that 
eigenstate. Consequently, the corresponding eigenvalue is then a conserved 
quantity. In classical mechanics dynamical symmetry about some axis im¬ 
plies that the component of angular momentum about that axis is conserved, 
so it is plausible that the conserved eigenvalue is a measure of angular mo¬ 
mentum. This suggestion will be substantiated in this chapter. Another 
important task for the chapter is to explain how the orientation of a system 
is encoded in the amplitudes for it to be found in different eigenstates of 
appropriate angular-momentum operators. We start by using the angular- 
momentum commutation relations to determine the spectrum of the Ji. 


7.1 Eigenvalues of J z and J 2 

Since no two components of J commute, we cannot find a complete set of 
simultaneous eigenkets of two components of J. We can, however, find a 
complete set of mutual eigenkets of J 2 and one component of J because 
[J 2 , Ji] = 0. Without loss of generality we can orient our coordinates so that 
the chosen component of J is J z . Let us label a ket which is simultaneously 
an eigenstate of J 2 and J z as |/3,m), where 

J 2 \/3,m) = /3|/3,m) ; J z \/3,m) = m\/3,m). (7.1) 


J ± — Jx i i J'i 


V' 


We now define 


(7.2) 
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These objects clearly commute with J 2 , while their commutation relations 
with J z are 


[J+i Jz\ — \Jxi Jz\ T i \Jyi Jz\ — 1 Jy Jx — J+ , 

[J—, Jz\ = [-Iji Jz] i[*3y, = ^Jy T Jx — J-- 

Since J± commutes with J 2 , the kets J±|/3,ra) are eigenkets of J 2 with 
eigenvalue /3. Operating with J z on these kets we find 

J z J + \P,m) = {J+J z + [■ J Z ,J +]) |/3, m) = (m + l)J + |/3 ,to) ^ 

J z J-\j3,m) = ( J-J z + [J*, J_]) |/3 ,m) = (m - l)J_|/3,m). 

Thus, J + |/3,m) and J_|/3, m) are also members of the complete set of states 
that are eigenstates of both J 2 and J z , but their eigenvalues with respect to 
J z differ from that of |/3, m) by ±1. Therefore we may write 

J±\/3,m) = a±\/3,m±l), (7.5) 


where a± is a constant that we now evaluate. We do this by taking the 
length-squared of both sides of equation (7.5). Bearing in mind that J' + = 
J_, we find 

|a+| = (/3,m|J_J+|/3,m) = ( 0,m\(J x - iJ y ){J x +iJ y )\/3,m ) 

= (/3,m|(J 2 - J 2 - J z )|/3,m) = /3 - m(m + 1). 

Similarly, |a_| 2 = /3 — m(m — 1), so 

a± = \Jfi — m(m ± 1). (7.7) 

The Ji are Hermitian operators, so (ip\J?\ip) = |J r i|//’)| 2 > 0. Hence 

/3 = (f3, m\J 2 \/3, to) = (fi,m\(J x + J 2 + J 2 2 )|^,?n) > ?n 2 . (7.8) 

So notwithstanding equation (7.5), it cannot be possible to create states with 
ever larger eigenvalues of J z by repeated application of J + . All that can stop 
us doing this is the vanishing of a+ when we reach some maximum eigenvalue 
w max that from equation (7.7) satisfies 

/3 rn max (77i max T 1) = 0. (7-9) 


Similarly, must vanish for a smallest value of m that satisfies 

/3 n7mi n (m m i n 1) = 0. (7.10) 

Eliminating /3 between (7.9) and (7.10) we obtain a relation between m max 
and m m i n that we can treat as a quadratic equation for m m ; n . Solving this 
equation we find that 


m mi „ = i{l ± (2m max + 1)}. (7.11) 

The plus sign yields a value of ra m i n that is incompatible with our require¬ 
ment that TO m i„ < m max , so we must have m m i n = ^m ma x- To simplify the 
notation, we define j = m max , so that equation (7.9) becomes /3 = j(j + 1) 
and —j < m < j. Finally, we note that since an integer number of applica¬ 
tions of J- will take us from |/3 ,j) to |/3, — j), 2 j must be an integer - see 
Figure 7.1. In summary, the eigenvalues of J 2 arej(j + l) with 2 j = 0,1,2,... 
and for each value of j the eigenvalues m of J z are (j, j — 1,..., — j). 

At this point we simplify the labelling of kets by defining | j, m ) to be 
what has hitherto been denoted |/3,m) with /3 = j(j + 1) - we clear a great 
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- m=2 

m = 3/2 - 


m=—3/2 - 

1=3/2 


- m=-2 

1=2 


Figure 7.1 Going from m m [ n to 
m ma x in an integer number of steps 
in the cases j = 2. 


deal of clutter from the page by replacing \j(j +1), to) with | j, to). The kets’ 
eigenvalues with respect to J 2 are of course unaffected by this relabelling. 
Had we known at the outset that the eigenvalues of J 2 would be of the form 
j(j + 1), we would have used the new notation all along. 

In summary, we can find simultaneous eigenstates of J 2 and one of the 
Ji , conventionally taken to be J z . The eigenvalues of J 2 are j(j + 1) with 
2 j 7 = 0,1,..., and for any given j the eigenvalues to of J z then run from +j 
to —j in integer steps: 

j >m> - j . (7.12) 

In order to move from the state | j, to) to the adjacent state | j, to ± 1) we use 
the raising or lowering operators J± which act as 

= a±(m)\j,m±l) = \Jj(j + 1) - m(m ± l)|j,m ± 1). (7.13) 


These operators only change the J z eigenvalue, so they just realign a given 
amount of total angular momentum, placing more (</+) or less (</_) along 
the 2 -axis. So far, we have not discovered how to alter the J 2 eigenvalue 

j(j + !)• 

It is sometimes helpful to rewrite the constants a± (m ) of equation (7.13) 


in the form 


o+(to) 

a- (m) 


\j - m)(j + to + 1) 
\j + m)(j — to + 1). 


(7.14) 


These equations make it clear that the proportionality constants for different 
to satisfy 


a+(?n) = a+(—m — 1) a+(?n — 1) = «_(to) 

ot— (to) = oc— (—to T 1) «_(to) = a+(— to). 


(7.15) 


For example, when J_ lowers the highest state \j,j), we obtain the same 
proportionality constant as when J + raises the lowest state | j,—j); conse¬ 
quently, we only need to work out half the constants directly, because we can 
then infer the others. 

By expressing J x = \{J+ + J-) in terms of the ladder operators, we 
observe that when we apply J x to a state | j, to) for j > 0 we obtain a ket 
that differs from | j,m), so |j, to) is never an eigenket of J x ■ Hence for j > 0 
it is impossible to be certain what will be the result of measuring both J z 
and J x . It is trivial to see that this argument extends to the pair (J 2 , J y ), 
so for j > 0 it is impossible to be certain of the outcome of more than one of 
the J,;. If j = 0 the outcome measuring of any component of J is certainly 
0, but a null vector has no direction. Consequently, there are no states in 
which the vector J has a well-defined direction. This situation contrasts with 
the case of the momentum vector p, which can have a well defined direction 
because its components commute with one another. 

In §4.1.2 we discovered that when the system is rotated through an 
angle a around the 2 axis, its ket \ip) transforms to |^') = U(a )|/>), where 



142 


Chapter 7: Angular Momentum 


the unitary operator U(a) = exp(— \aJ z ). If | ip) = |j, m) is an eigenket of 
J z . U(a) simply changes its phase: 

U(a)\j,m) = e~ iaJz \j,m) = e~ iam \j,m). (7.16) 

Since 2 j is an integer, j (and hence in) must be either an integer or a half in¬ 
teger. Using this information in equation (7.16), we see that, after a rotation 
through 27 t around the 2 -axis, we have either 


|j, m) —> | j, m) for m even 


(7.17a) 


or 


\j, to) —> — | j,m) for m odd. 


(7.17b) 


Equation (7.17a) is as expected; under a 27 t rotation, the system returns to 
its original state. However, equation (7.17b) says that a system with half 
integer angular momentum does not return to its original state after a 27 t 
rotation - the initial and final states are minus one another! This difference of 
behaviour between systems with integer and half-integer angular momentum 
is of fundamental importance, and determines many other characteristics of 
these systems. A result of quantum field theory is that ‘spin-half’ fields 
never attain macroscopic values: the quantum uncertainty in the value of 
a spin-half field is always on the same order as the value of the field itself. 
Integer-spin fields, by contrast, can attain macroscopic values: values that 
are vastly greater than their quantum uncertainties. Consequently, classical 
physics - physics in the absence of quantum uncertainty - involves integer- 
spin fields (the electromagnetic and gravitational fields are examples) but no 
spin-half field. Our intuition about what happens when a system is rotated 
has grown out of our experience of classical physics, so we consider that 
things return to their original state after rotation by 27 t. If we had hands-on 
experience of spin-half objects, we would recognise that this is not generally 
true. 


7.1.1 Rotation spectra of diatomic molecules 

Knowledge of the spectrum of the angular momentum operators enables us 
to understand an important part of the dynamics of a diatomic molecule such 
as carbon monoxide. For some purposes a CO molecule can be considered 
to consist of two point masses, the nuclei of the oxygen and carbon atoms, 
joined by a ‘light rod’ provided by the electrons. In this model the molecule’s 
moment of inertia around the axis that joins the nuclei is negligible, while 
the same moment of inertia / applies to any perpendicular axis. 

In classical mechanics the rotational energy of a rigid body is 



where the A are the moments of inertia about the body’s three principal 
axes and J is the body’s angular-momentum vector. We conjecture that 
the equivalent formula links the Hamiltonian and the angular momentum 
operators in quantum mechanics: 



(7.19) 


The best justification for adopting this formula is that it leads us to results 
that are confirmed by experiments. 

In the case of an axisymmetric body, we orient our body such that 
the symmetry axis is parallel to the 2 axis. Then I = I x = I y and the 
Hamiltonian can be written 



(7.20) 
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From this formula and our knowledge of the eigenvalues of J 2 and J z , we 
can immediately write down the energies that form the spectrum of H: 


E- = _ 

4m 2 




(7.21) 


where j is the total angular-momentum quantum number and |m| < j. In 
the case of a diatomic molecule such as CO, I z <C I so the coefficient of m 2 is 
very much larger than the coefficient of j(j + 1) and states with \m\ > 0 will 
occur only far above the ground state. Consequently, the states of interest 
have energies of the form 


E j = j(j + l)|j. (7.22) 

For reasons that will emerge in §7.2.2, only integer values of j are allowed. 

CO is a significantly dipolar molecule. The carbon atom has a smaller 
share of the binding electrons than the oxygen atom, with the result that it 
is positively charged and the oxygen atom is negatively charged. A rotating 
electric dipole would be expected to emit electromagnetic radiation. Because 
we are in the quantum regime, the radiation emerges as photons which, as 
we shall see, can add or carry away only one unit h of angular momentum. 
It follows that the energies of the photons that can be emitted or absorbed 
by a rotating dipolar molecule are 

h 2 

E p = ± (Ej — Ej_ i) = ±J —. (7.23) 


Using the relation E = hv between the energy of a photon and the frequency 
v of its radiation, the frequencies in the rotation spectrum of the molecule 
are 



(7.24) 


In the case of 12 CO, the coefficient of j evaluates to 113.1724 GHz and spec¬ 
tral lines occur at multiples of this frequency (Figure 7.2). 

In the classical limit of large j, J = jh is the molecule’s angular mo¬ 
mentum, and this is related to the angular frequency w at which the molecule 
rotates by J = Iu. When in equation (7.24) we replace jh by Iuj, we dis¬ 
cover that the frequency of the emitted radiation v is simply the frequency 
w/27t at which the molecule rotates around its axis. This conclusion makes 
perfect sense physically. Now, because of the form of the Hamiltonian, the 
energy eigenstates are also the eigenstates of J z and J 2 . Therefore in any 
energy eigenstate, (J 2 ) = j{j + 1) and for low-lying states with m = 0 and 
j ~ 0(1), j(j + 1) is significantly larger than j 2 . Therefore Vj in (7.24) 
is smaller than the frequency at which the molecule rotates when it is in 
the upper state of the transition. On the other hand, Vj is larger than the 
rotation frequency \J(j — l)j^j of the lower state. Hence the frequency at 
which radiation emerges lies between the rotation frequencies of the upper 
and lower states. Again this makes sense physically. As we approach the 
classical regime, j becomes large so j(j + 1) ~ j 2 ~ (j — 1 )j and the rotation 
frequencies of the upper and lower states converge, from above and below, 
on the frequency of the emitted radiation. 

Measurements of radiation from 115 GHz and the first few multiples of 
this frequency provide one of the two most important probes of interstel¬ 
lar gas. 1 In denser, cooler regions, hydrogen atoms combine to form H 2 
molecules, which are bisynnnetric and do not have an electric dipole mo¬ 
ment when they are simply rotating. Consequently, these molecules, which 


1 The other key probe is the hyperfine line of atomic hydrogen that will be discussed 
in Chapter 8. 
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y/GHz 

Figure 7.2 The rotation spectrum of CO. The full lines show the measured frequencies 
for transitions up to j = 38 —> 37, while the dotted lines show integer multiples of the 
lowest measured frequency. Up to the line for j = 22 —t 21 the dotted lines are obscured 
by the full lines except at one frequency for which measurements are not available. For 
j > 22 the separation between the dotted and full lines increases steadily as a consequence 
of the centrifugal stretching of the bond between the molecule’s atoms. Measurements are 
lacking for several of the higher-frequency lines. 


together with similarly uncommunicative helium atoms make up the great 
majority of the mass of cold interstellar gas, lack readily observable spectral 
lines. Hence astronomers are obliged to study the cold interstellar medium 
through the rotation spectrum of the few parts in 10 6 of CO that it contains. 

Important information can be gleaned from the relative intensities of 
lines associated with different values of j in equation (7.24). The rate at 
which molecules emit radiation and thus the intensity of the line 2 is propor¬ 
tional to the number rij of molecules in the upper state. As we shall deduce 
in §7.5.3, all states have equal a priori probability, so rij is proportional to 
the number of states that have the given energy - the degeneracy or sta¬ 
tistical weight g of the energy level. From §7.1 we know that g = 2j + 1 
because this is the number of possible orientations of the angular momentum 
for quantum number j. 

In §6.4 we saw that when a gas is in thermal equilibrium at temper¬ 
ature T, the probability pj that a given molecule is in a state of energy 
Ej is proportional to the Boltzmann factor exp(— Ej/k^T), where ks is the 
Boltzmann constant (eq. 6.93a). Combining this proportionality with the 
dependence on the degeneracy 2 j + 1 just discussed leads us to expect that 
the intensity of the line at frequency vj will be 

Tj cx (2 j + 1) exp (-Ej/ksT) (j > 0). (7.25) 

For Ei < k'e.T, Ij increases at small j before declining as the Boltzmann 
factor begins to overwhelm the degeneracy factor. Fitting this formula, which 
has only one free parameter (T), to observed line intensities enables one both 
to measure the temperature of the gas, and to check the correctness of the 
degeneracy factor. 

Figure 7.2 shows that for large values of the quantum number j , the 
spacing between lines in the spectrum diminishes in apparent violation of the 
prediction of equation (7.24). Lines with large j are generated by molecules 
that are spinning very rapidly. The bond between the nuclei is stretched like 
a spring by the centripetal acceleration of the nuclei. Stretching of the bond 
increases the moment of inertia /, and from equation (7.24) this decreases 
the frequency of the spectral lines (Problem 7.2). 


2 We neglect the absorption of photons after emission, which can actually be an im¬ 
portant process, especially for 12 CO. 
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7.2 Orbital angular momentum 

Let x and p be the position and momentum operators of the system. Then, 
inspired by classical mechanics, we define the dimensionless orbital angular 
momentum operators by 3 


L = 


ixxp, 


that is Li 


^ e ijkXjPk- 
jk 


(7.26) 


From the rules of Table 2.1 and the Hermitian nature of x and p, the Her- 
mitian adjoint of Lj is 

L\ = — e ijkp\x\ = ^ tijkXjPk = Li, (7.27) 

jk jk 


where we have used the fact that [. Xj,pk ] = 0 for j ^ k. Thus the Li are 
Hermitian and are likely to correspond to observables. We also define the 

total orbital angular momentum operator by 


L 2 eeL-L = L 2 +L 2 +L 2 , (7.28) 

which is again Hermitian, and calculate a number of commutators. First, 
bearing in mind the canonical commutation relation (2.54), we have 


[Li, X[[ — ^ y ^ C/y/,: [.f'jPf.. X/[ ^ y ' eijkXj \pkt XI ] 1 y ) ejjlXj 

jk jk j 

= i y ' eujXj. 
j 


(7.29) 


Similarly 


[Li,Pi\ 


1 

h 


y ' Ojk [XjPk 5 Pl\ 
jk 


1 

h 


y^jk[xj,pi] Pk 

jk 


= iJ2eujPj. 
j 


(7.30) 


Notice that these commutation relations differ from the corresponding ones 
for Ji [equations (4.30) and (4.32)] only by the substitution of L for J. From 
these relations we can show that Li commutes with the scalars x 2 , p 2 and 
x • p. For example 


[Lj.,p ] y [ [Lj,Pj] i y [ tjjkjpkPj T PjPk ) — 0, (7.31) 

j jk 

where the last equality follows because the e symbol is antisymmetric in jk 
while the bracket is symmetrical in these indices (see also Problem 7.3). We 
can now also calculate the commutator of one component of L with another. 
We have 


[L x , L y \ = ~[L X , ( zp x - xp z )\ = i(—yp x + xp y ) = i L z . (7.32) 

Clearly each Lj commutes with itself, and the other non-zero commutators 
can be obtained from equation (7.32) by permuting indices. These commu¬ 
tators mirror the commutators (7.104) of the Jj. 

L is a vector operator by virtue of the way it is constructed out of the 
vectors x and p. It follows that L 2 is a scalar operator. Hence the way these 

3 In many texts L is defined without the factor h ~ 1 . By making L dimensionless, this 
factor simplifies many subsequent formulae. 
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operators commute with the total angular momentum operators Jj follows 
from the work of §4.2: 

[Ji, Lj] = Cijk L k ; [J,:,i 2 ]=0. (7.33) 

k 

Although p 2 and x 2 commute with Lj, the total angular momentum 
operator J 2 does not: 

[J 2 , Lj] = ]T[J 2 , Lj] = i ]T e jik (L k Jj + JjL k ). (7.34) 

3 jk 

The right side does not vanish because the final bracket is not symmetric in 
jk. The physical significance of [J 2 ,Lj] being non-zero is that if our system 
is in a state of well-defined total angular momentum, in general there will 
be uncertainty in the amount of orbital angular momentum it has about any 
axis. We shall explore the consequences of this fact in §7.5. 


7.2.1 L as the generator of circular translations 

In §4.1.1 we saw that when the system is displaced along a vector a, its ket 
is transformed by the unitary operator 17(a) = e~ la ' p ^ h . We now imagine 
successively performing n translations through vectors {ai, a 2 ..., a„}. Since 
each translation will cause | if)) to be acted on by a unitary operator, the final 
state will be 


U(a n )... t/(a 2 )17(ai)|?/>) = jjjexp ' P^jlV’) 


= exp 




(7.35) 


where the second equality follows because the components of p commute 
with one another. Since the exponent in the last line is proportional to the 
overall displacement vector A = Y^ii= l the change in \i/j) is independent 
of the path that the system takes. In particular, if the path is closed, A = 0 
and | ip) is unchanged. 

Now consider the effect of moving the system in a circle centred on the 
origin and in the plane with unit normal n. When we increment the rotation 
angle a by Sa, we move the system through 


(5a = 8a n x x. 

The associated unitary operator is 


i 


17(5a) = exp ——5a (n x x) • p = expl ——5a n • (x x p) 

IS 11 fl 


= e 


h 

— i<5a; n-L 


(7.36) 


(7.37) 


The unitary operator corresponding to rotation through a finite angle a is a 
high power of this operator. Since the exponent contains only one operator, 
n-L, which inevitably commutes with itself, the product of the exponentials 
is simply 

17(a) = e _ia L , (7.38) 

where a = an. 

The difference between the total and orbital angular momentum oper¬ 
ators is now apparent. When we rotate the system on a turntable through 
an angle a, the system’s ket is updated by e -1 “' J . When we move the sys¬ 
tem around a circle without modifying its orientation, the ket is updated 
by e”‘“' L . The crucial insight is that the turntable both moves the system 
around a circle and reorientates it. The transformations of which J is the 
generator reflects both of these actions. The transformations of which L is 
the generator reflects only the translation. 
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Figure 7.3 J both swings the particle around the origin and rotates its spin (left), while 
L moves the particle, but leaves the direction of the spin invariant (right). 


7.2.2 Spectra of L 2 and L z 

We have shown that the L, commute with one another in exactly the same 
way that the Ji do. In §7.1 we found the possible eigenvalues of J 2 and J z 
from the commutation relations and nothing else. Hence we can without 
further ado conclude that the possible eigenvalues of L 2 and L z are l(l + 1) 
and m, respectively, with —l<m< l, where l is a member of the set 

(oil® 1 

In the last subsection we saw that L is the generator of translations 
on circles around the origin, and we demonstrated that when a complete 
rotation through 2-7T is made, the unitary operator that L generates is simply 
the identity. Consider the case in which we move the system right around the 
z axis when it is in the eigenstate |/, to) of L 2 and L z . The unitary operator 
is then e~ 2nlLz and the transformed ket is 

1 1, to) = e - 27riL * 1 1, to) = e- 2m7ri | l, to). (7.39) 

Since the exponential on the right side is equal to unity only for integer to, 
we conclude that L z , unlike J z has only integer eigenvalues. Since for given 
l , m runs from —l to /, it follows that l also takes only integer values. Thus 
the spectrum of L 2 is l(l + 1) with l = 0,1,2,..., and for given l the possible 
values of L z are the integers in the range (—1,1). 


7.2.3 Orbital angular momentum eigenfunctions 

We already know the possible eigenvalues of the operators L 2 and L z . Now 
we find the corresponding eigenfunctions. 

In the position representation, the Li become differential operators. For 
example 

L, = \[xp y -yp,) = -i C-40) 


Let (r, 9, 0) be standard spherical polar coordinates. Then the chain rule 
states that 


d dx d dy d dz d 

dcj) d(j> dx d(j) dy d(f)dz' 

Using x = r sin 9 cos <f>, y = r sin 9 sin 0 and z = r cos 9 we find 


(7.41) 


— = r sin 9 
d(p 


, d d 

- sin 0— + cos 0— 
dx dy 


X d~y- y Tx 


= i L z 


(7.42) 


That is 

= - l w < M3 » 

Let 1 1, to) be a simultaneous eigenket of L 2 and to. for the eigenvalues 1(1 + 1) 
and to, respectively. Then L z \l, to) = m\l, to) and the wavefunction 0; m (x) = 
(x|Z,to) must satisfy the eigenvalue equation 

. dlplm 
d(f> 


m^lm- 


(7.44) 
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The solution of this equation is 

*Pim(r,0,<t>) = K lm (r,d)e im *, (7.45) 


where Ki m is an arbitrary function of r and 0. Since m is an integer, is 
a single-valued function of position. 

In our determination of the spectra of J 2 and J z in §7.1, important roles 
were played by the ladder operators J± = ( J x ± iJ y ). If we define 


L ± = L x ±iL y , (7.46) 

then by analogy with equation (7.5) we will have that 

L±\l, m) = a±\l, m ± 1), (7.47a) 

where _ 

a±(m ) = \Jl{l + 1) - m(m ± 1). (7.47b) 

It will be helpful to express L± in terms of partial derivatives with re¬ 
spect to spherical polar coordinates. We start by deriving a relation between 
partial derivatives that we will subsequently require. From the chain rule we 
have that 

8 ( 8 8 \ 8 
— = r cos 01 cos 6— —I- sin 6— — r sin0—. (7.48a) 

88 \ ox dy) oz 

Multiplying the corresponding expression (7.42) for <f> by cot0 yields 


cot 0-—- = r cos 0 ( — sin 

8q> 


& 


COS (j) 


dy 


(7.48b) 


Adding or subtracting i times (7.48b) to (7.48a) we obtain 


d 


8 


88 


8<j> 


8 


8 


— ± i cot 0— = r cos 0 ^ (cos <j) =F i sin <j>) — + (sin 4> ± i cos <j>) — 


dy 


( d (J \ cf 

—— ± i— J — rsin0—. 
8x 8v / 8z 


8 


8 


Mx dy 


Multiplying through by e ±lc ^, we obtain the needed relation: 

e±l4 (w ± ico " 9 -k) = ri “"(s =*= %) - r si " 6 


r sin 0— 
8z 


(7.49) 

(7.50) 


With this expression in hand we set to work on L + . In the position repre¬ 
sentation it is 


. . „ . .8 8 
L + = - l \yiT- z 77:.) + \ z 77Z^ x 7C 


= z( 


8z 
8 . 8 
\dx dy 


dy 

) - (x + i y) 


8x 

8_ 

8z 


8z 


= r cos d(-^~ + i-^-) — r sinde 1 ^ 
\8x dy) 


8z 


so with equation (7.50) we can write 

8 


L-l. — 6 


\88 




a0 + icot %y 


(7.51) 


(7.52a) 


.(8 8 \ (8 8 \ 
L - = -'Vd-z-%)-[ z oi- x a;) 

(8 . 8 \ 8 

= ~ 


(7.52b) 


Similarly 
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Table 7.1 The first six spherical harmonics 



The state |/, /) with the largest permissible value of m for given l must 
satisfy the equation L + \l,l) = 0. Using equations (7.45) and (7.52a), in the 
position representation this reads 


dKu 

dO 


l cot OKu — 0. 


(7.53) 


This is a first-order linear differential equation. Its integrating factor is 
exp (—l f dO cot 0) = sin - * 6 , so its solution is Ku = R(r) sin* 0, where R is 
an arbitrary function. Substituting this form of Ku into equation (7.45), we 
conclude that 

tpu (r, 0 , (j>) = R(r) sin* 0 e 1 *'*’. (7-54) 

From equation (7.54) we can obtain the wavefunctions ipim of states with 
smaller values of m simply by applying the differential operator L_. For 
example 


= constant x R(r)e~ lc/> ( - — + icotfl—) sin'fle 1 * 0 

\ dO d(j>J (7.55) 

= constant x R(r) sin* -1 OcosOe I< -* -1 7‘*’. 

Hence, the eigenfunctions of L 2 and L z for given l all have the same radial 
dependence, R(r). The function of 0,(j> that multiplies R in ipi m is conven¬ 
tionally denoted Y™ and called a spherical harmonic. The normalisation 
of YJ” is chosen such that 


J d 2 H|YH 2 = l with cl 2 fl = sin 0 dOdtp (7.56) 

the element of solid angle. We have shown that 

Y\ oc sin* 6e il ^ and Y ; * -1 a sin* -1 0cos0e i( * _1)< ^. (7.57) 

The normalising constants can be determined by first evaluating the integral 

/ d 2 n sin 2 * 0 = 4tt 2 2 * / (7.58) 

J (2Z + 1)! V ; 

involved in the normalisation of Y / ? and then dividing by the factor of 
equation (7.47b) each time L_ is applied. 

The spherical harmonics YJ n for l < 2 are listed in Table 7.1. Figures 7.4 
and 7.5 show contour plots of several spherical harmonics. Since spherical 
harmonics are functions on the unit sphere, the figures show a series of 
balls with contours drawn on them. Since spherical harmonics are complex 
functions we had to decide whether to show the real part, the imaginary 
part, the modulus or the phase of the function. We decided it was most 
instructive to plot contours on which the real part is constant; when the real 
part is positive, the contour is full, and when it is negative, the contour is 
dotted. 
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Figure 7.4 Contours of 5ft(YJi) on the unit sphere for m = 15 (left), m = 7 (centre) and 
m = 2 (right). The contours on which ;, R() = 0 are the heavy curves, while contours 
on which 5R(Y^) < 0 are dotted. Contours of the imaginary part of YY 1 would look the 
same except shifted in azimuth by half the distance between the heavy curves of constant 
azimuth. 



Figure 7.5 Top row: contours of ;, R(Y"' ) for in = 1 (left) and 0 (right) with line styles 
having the same meaning as in Figure 7.4. Contours of the imaginary part of Y j would 
look the same as the left panel but with the circles centred on the y axis. Bottom row: 
contours of 5ft(Y™) for m = 2 (left), m = 1 (centre) and m = 0 (right). 


For large l, Y\ is significantly non-zero only where sin (9 ~ 1, i.e., around 
the equator, 9 = 7r/2 - the leftmost panel of Figure 7.4 illustrates this case. 
The first l applications of L_ each introduce a term that contains one less 
power of sin 9 and an extra power of cos 9. Consequently, as m diminishes 
from l to zero, the region of the sphere in which Y ; m is significantly non-zero 
gradually spreads from the equator toward the poles - compare the leftmost 
and rightmost panels of Figure 7.4. These facts make good sense physically: 
Yj is the wavefunction of a particle that has essentially all its orbital angular 
momentum parallel to the z axis, so the particle should not stray far from the 
xy plane. Hence Yj, the amplitude to find the particle at 9, should be small 
for 9 significantly different from tt/2. As to diminishes the orbital plane is 
becoming more inclined to the xy plane, so we are likely to find the particle 
further and further from the plane. This is why Y[™ increases away from the 
equator as m decreases. 

For large l the phase of Y\ changes rapidly with <j> (leftmost panel of 
Figure 7.4). This is to be expected, because the particle’s large orbital 
angular momentum, IU , implies that the particle has a substantial tangential 
motion within the xy plane. From classical physics we estimate its tangential 
momentum at p = ITi/r , and from quantum mechanics we know that this 
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implies that the wavefunction must change its phase at a rate p/Ti = l/r 
radians per unit distance. This estimate agrees precisely with the rate of 
change of phase with distance around the equator arising from the factor e' 1 ^ 
in Y\. When m is significantly smaller than l (rightmost panel of Figure 7.4), 
the rate of change of the wavefunction’s phase with increasing </> is smaller 
because the particle’s tangential momentum is not all in the direction of 
increasing (j). Hence Y ; m oc e lm A 

For any value of to, L x and L y both have zero expectation values, as 
follows immediately from the relation L x = \(L + + L_). So the orientation 
of the component of the angular momentum vector that lies in the xy plane 
is completely uncertain. Because of this uncertainty, the modulus of 
is independent of <f>, so there is no trace of an inclined orbital plane when 
to < l. An orbital plane becomes defined if there is some uncertainty in L z , 
with the result that there are non-zero amplitudes ip m = (l,m\ip) for several 
values of to. In this case quantum interference between states of well-defined 
L z can generate a peak in |(x|^>)| 2 along a great circle that is inclined to the 
equator. 


7.2.4 Orbital angular momentum and parity 

In §4.1.4 we defined the parity operator P, which turns a state with wave- 
function ?/>(x) into the state that has wavefunction We now 

show that wavefunctions that are proportional to a spherical harmonic YJ n 
are eigenfunctions of P with eigenvalue (—1)*. 

In polar coordinates the transformation x —> —x is effected by 8 —> tt — 0, 
<f> <j>+ 7r. Under this mapping, sind = sin(7r — 8) is unchanged, while e lZ< ^ —> 
e U7r e U0 = By equation (7.57), Y\ oc sin*0e u *, so Y\ (-l)'Yf. 

That is, Y ; has even parity if l is an even number and odd parity otherwise. 

In §4.1.4 we saw that x and p are odd-parity operators: Px = —xP. 
From this and the fact that the orbital angular momentum operators Li 
are sums of products of a component of x and a component of p, it follows 
that both the Li and the ladder operators L± = L x ± i L y are even-parity 
operators. Now Y™^ 1 = L_Y ; m /a_, where a_ is a constant, so applying 
the parity operator 

PY!" 1 = —L_PY\ = {-l) l —LJY\ = (-l^Yf- 1 . (7.59) 

GL— GL— 

That is, Yj -1 has the same parity as Y l t . Since all the Y[" for a given l can 
be obtained by repeated application of L_ to Yf, it follows that they all have 
the same parity, (—1) ; . 


7.2.5 Orbital angular momentum and kinetic energy 

We now derive a very useful decomposition of the kinetic energy operator 
Hk = p 2 /2m into a sum of operators for the radial and tangential kinetic 
energies. First we show that L 2 is intimately related to the Laplacian opera¬ 
tor V 2 . From the definition (7.46) of the ladder operators for orbital angular 
momentum, we have 


— ( L x + i Ly){L x — iL y ) — L 2 + L 2 + i [L y , L x \ 
= L 2 + L 2 + L z . 


(7.60) 


Hence with equations (7.52) we may write 


L 2 = L+L_ - L z + L 2 z 


d . d 

m +lcot % 


= e 






d_ 

88 


icot 8 


d(j>) 


. 8 8 2 

1 8(j> d(j > 2 ' 



152 


Chapter 7: Angular Momentum 


Differentiating out the right side 


L 2 


d 2 2 <9 2 „/ d 3 \ 

ap~ cot e a4? +cote {~aS + ‘ cote dd) 


d_ 

d<j>s 


— 1CSC 


d_ ,d__ 

d<j> 1 d<j) 


cP_ 

d(f> 2 


The first-order terms in d/d(f> cancel because cot 2 6 — esc 2 9 = — 1. This 
identity also enables us to combine the double derivatives in </>. Finally the 
single and double derivatives in 6 can be combined so that the equation 
becomes 

= (76I> 

which we recognise as — r 2 times the angular part of the Laplacian operator 
V 2 . 

Now we ask “what is the operator associated with radial momentum?”. 
The obvious candidate is rp, where r is the unit vector in the radial direction. 
Unfortunately this operator is not Hermitian: 

(r-p) f = p-r ^r-p, (7.62) 


so it is not an observable. This is a particular case of a general phenomenon: 
the product AB of two non-commuting observables A and B is never Her¬ 
mitian. But it is easy to see that \{AB + BA) is Hermitian. So we define 


Pr = a( ? -P + P-r) 


(7.63) 


which is manifestly Hermitian. We will need an expression for p r in the 
position representation. Replacing p by —i?iV we have 


ih f 1 _ _ , . , 

p r = -r • V + V • (r/r) 


From the chain rule it is straightforward to show that 


d 


d 


r dr X dx 


V 


d_ 

dy 


d 

z — =r V . 

dz 


Moreover, V • r = 3, so equation (7.64) can be rewritten 

ih 


Pr = - 


2 

= — ih 


d_ c 
dr + r 
d_ 1 
dr r 


r 

77 


d_ 

dr 


This expression enables us to find the commutator 

d 


[r,Pr] = -ifr 


r, 


dr 


= ih. 


Squaring both sides of equation (7.66) yields 


2 *2 
Pr = -ri 


d_ 1 
dr + r 


d_ 

dr 


= -r 


d 2 2d 


dr 2 


+ 


dr 


- o T 


tf_d_ 

r 2 dr 


,d_ 

dr 


(7.64) 


(7.65) 


(7.66) 


(7.67) 


(7.68) 


We recognise this operator as — h 2 times the radial part of the Laplacian op¬ 
erator V 2 . Since we have shown that L 2 is —r 2 times the angular part of the 
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Table 7.2 The first five Legendre polynomials 



l 

0 12 3 4 


1 p \(3p 2 — 1) i(5/x 3 -3p) |(35/u 4 — 30p 2 + 3) 


Laplacian (eq. 7.61), it follows that V 2 = — (p 2 /h 2 + L 2 /r 2 ). Consequently, 
the kinetic-energy operator Hk = p 2 / 2 m = — (h 2 /2m)V 2 can be written 

Hk= 2^. {f r + "r 2 ) ' (7 ' 69) 


The physical interpretation of this equation is clear: classically, the orbital 
angular momentum hh is mr x v = mrvt, where Vt. is the tangential speed, 
so the term h 2 L 2 /2mr 2 = \mv 2 is the kinetic energy associated with the 
tangential motion. On the other hand p 2 /2m = \mv 2 , so this term repre¬ 
sents the kinetic energy due to radial motion, as we would expect. For future 
reference we note that the kinetic-energy operator can be also written 


2m \ r 2 dr \ dr J r 2 / 


(7.70) 


7.2.6 Legendre polynomials 

The spherical harmonic Y° is special in that it is a function of 9 only. We 
now show that it is, in fact, a polynomial in cos 9. In the interval 0 < 9 < 
7T of interest, 9 is a monotone function of p = cosf?, so without any loss 
of generality we may take Y ; ° to be proportional to a function Pi{p). On 
this understanding, Pi is an eigenfunction of L 2 with eigenvalue l(l + 1). 
Transforming the independent variable from 9 to p in our expression (7.61) 
for L 2 , we find that Pi must satisfy Legendre’s equation: 

— (^(1 - p 2 )-j-^^) + + 1)P; = 0. (7-71) 

We look for polynomial solutions of this equation. Putting in the trial solu¬ 
tion Pi = J2n bnP n , we find 

b n {n(n — 1 )p n ~ 2 — n(n + 1 )p n + 1(1 + 1 )p n } = 0. (7.72) 


This equation must be valid for any value of p in the interval (—1,1), which 
will be possible only if the coefficient of each and every power of p individually 
vanishes. The coefficient of p k is 

0 = bk+ 2 (k + 2)(7c + 1) — bk {k[k + 1) — 1(1 + 1)} . (7.73) 

For k = 0 the expression connects 62 to 60 , while for k = 2 it relates 64 to 62 , 
and so on. Thus from this equation we can express b n as a multiple of bo for 
even n, and as a multiple of b± for odd n. Moreover, if l is an even number, 
we know from our discussion of parity that Pi must be an even function 
of p , so in this case b n must vanish for n odd. Finally, b n will vanish for 
n even and greater than l on account of the vanishing of the curly bracket 
in equation (7.73) when k = l. This completes the proof that for even l, 
Pi{p) is a polynomial or order l. An extremely similar argument shows that 
Pi(p) is also a polynomial of order l when l is odd. The first five Legendre 
polynomials are listed in Table 7.2. 
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The conventional normalisation of the Legendre polynomial Pi is the 
requirement that P/( 1) = 1. With this property, the Pi are not orthonormal. 
In fact 

J d/x Pi(p)Pv(p) = ' ( 7 -74) 

From this result it easily follows that the proportionality constant between 
Pi(cos9) and the orthonormal functions Y 9(6) is such that 


Y ?(0) 


21 + 1 
47r 


Pi (cos 9). 


(7.75) 


7.3 Three-dimensional harmonic oscillator 

In this section we discuss the dynamics of a particle that moves in three 
dimensions subject to a central force that is proportional to the particle’s 
distance from the origin. So the Hamiltonian is 

H = + lmuj 2 r 2 . (7.76) 

2m z 

If we use Cartesian coordinates, this Hamiltonian becomes the sum of three 
copies of the Hamiltonian of the one-dimensional harmonic oscillator that 
was the subject of §3.1: 


H = H X + H y + H z , (7.77) 

where, for example, H x = (p 2 /2m)+^mco 2 x 2 . These one-dimensional Hamil¬ 
tonians commute with one another. So there is a complete set of mutual 
eigenkets. Let \n x , n y , n z ) be the state that is an eigenket of H x with eigen¬ 
value ( n x + ^)hu> eq. 3.12, etc. Then | n x ,n y ,n z ) will be an eigenket of the 
three-dimensional Hamiltonian (7.76) with eigenvalue 

E = [n x + n y + n z + §)?ku. (7.78) 

Moreover, in the position representation the wavefunction of this state is just 
a product of three of the wavefunctions we derived for stationary states of a 
one-dimensional oscillator 


tHx) = u nx {x)u ny (y)u nz {z). 


(7.79) 


In view of these considerations it might be thought that there is nothing 
we do not know about the Hamiltonian (7.76). However, it is instructive 
to reanalyse the system from a more physical point of view, that recognises 
that the system is spherically symmetric. We have seen that [Li,p 2 ] = 0, 
and [Li,?’ 2 ] = 0, so [Li, H] = 0 and [L 2 ,H] = 0. From this result it follows 
that there is a complete set of mutual eigenstates of H, L 2 and L z . Very 
few of the eigenstates obtained from the one-dimensional Hamiltonians are 
eigenstates of either L 2 or L z . We now show how the eigenvalue problem 
associated with (7.76) can be solved in a way that yields mutual eigenkets of 
H , L 2 and L z . This exercise is instructive in itself, and some technology that 
we will develop along the way will prove extremely useful when we analyse 
the hydrogen atom in Chapter 8. 

We use equation (7.69) to eliminate p 2 from equation (7.76) 


2r2 


H = 


Pr | U L 
2m 2 mr 2 


+ \mia 2 r 2 . 


(7.80) 
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Figure 7.6 The effective potential 
(7.82) for (from bottom to top) 
l = 0,..., 5. The beads mark the 
classical turning points at the values 
of energy and angular momentum 
that quantum mechanics allows. 


We can assume that our energy eigenstates are also eigenstates of L 2 , so in 
this Hamiltonian we can replace L 2 by an eigenvalue 1(1 + 1). Hence we wish 
to find the eigenvalues of the radial Hamiltonian 


pI | i(i +i)h 2 

2m 2 mr 2 


1 2 2 
±mcu r . 


(7.81) 


This is the Hamiltonian for a particle that moves in a one-dimensional ef¬ 
fective potential 


V eS (r) = * (Z 2 *y 2 + § m.oj 2 r 2 . (7.82) 

Hi governs the oscillations of the mass about the minima of this potential, 
which is plotted in Figure 7.6. The eigenkets \E,l,m) of H are products of 
the eigenkets \E,l) of Hi and the eigenkets | l,m) of L 2 and L z : 

\E,l,m) = \E,l)\l,m). (7.83) 

Our determination of the allowed energies of a one-dimensional har¬ 
monic oscillator exploited the dimensionless operators A and A', which 
rather nearly factorise H/Tiu>. So here we define the operator 


A t = 


(i +1 )h 

i p r -b muir 


y/ 2mhui 

The product of A and its Hermitian adjoint Al, is 


A] At = 


1 


2mhui 

1 

2mTiuj 

1 


-i Pr - 


(l + 1 )h 


+ muir i p r — 


, Pr + “ 


(l + l)h 


rV 


+ muir + i 
r ) 


([+ m 

r 

(i + i)h 


i(i +i)h 2 


+ m 2 ui 2 r 2 — (21 + 3 )hr 


2mhui 

Comparing the right side with equation (7.81), we see that 
Hi = hui + (l + §)) , 


(7.84) 


muir 


■ muir , p r 


(7.85) 

(7.86) 


which bears a strong similarity to equation (3.3) for the one-dinrensional 
harmonic oscillator. 
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The commutator of Ai and is 


[AuA\\ — 


1 


2 mhui 
i 

mhui 


1 Pr 


Pr, 


(l + l)h 

r 

(i + i)h 


+ muir 


- 1 Pr 


(l + l)h 


0I + i)fi , , 

2 ' ^ 5 

mwr z 

(7.87) 

where we have used equation (7.67) to reach the last line. This result can be 
written more usefully in the form 


[A l ,A\\= Hl+1 n ^ R] + 1. (7.88) 

From this expression and equation (7.86) we can easily calculate the com¬ 
mutator of Hi with Ap. 


[- Ai,Hi] = Uuj[Ai,A]Ai} = hoj[Ai,A]]Ai = (H l+t - H t + Hu) At. (7.89) 

Now let | E, l ) be an eigenket of Hi with eigenvalue E: 


H l \E,l) = E\E,l). (7.90) 

We multiply both sides of the equation by Ai and use equation (7.89) to 
reverse the order of Ai and Hp. 

EAi\E,l) = AiHi\E,l) = (HiAi + [A h Hi])\E,l) 

= (Hi + i + hui)Ai\E,l). 

On rearrangement this yields 

Hi+ 1 (A t | E, l)) = (E- hu)(At | E, l)), (7.92) 

which says that Ai\E, l} is an eigenket of i7/+i for the eigenvalue E — Tiui, so 

At\E,l)=a-\E-hu,l + l), (7.93) 

where is a normalising constant. 

Ai creates the radial wavefunction for a state that has more orbital 
angular momentum and less energy than the state with which it started. 
That is, Ai diminishes the radial kinetic energy by some amount and adds a 
smaller quantity of energy to the tangential motion. If we repeat this process 
a sufficient number of times, by following Ai with Ai + \ and Ai + \ with Ai + 2 , 
and so on, there will come a point at which no radial kinetic energy remains 
- we will have reached the quantum equivalent of a circular orbit. The next 
application of Ai must annihilate the wavefunction. Hence Ac\E,C) = 0, 
where C(E) is the largest allowed value of l for energy E. If we operate on 
\E,C) with He, we find with equation (7.86) that 

E\E,C) = H c \E,C) = hu(£ + %)\E,£), (7.94) 

SO 

E = (C + | )hu> and C(E) = -jp -— §• 

Tiuj 

Since £ is a non-negative integer, it follows that the ground-state energy is 
|/ioj and that the ground state has no angular momentum. In general E/hu 
is any integer plus |. These values of the allowed energies agree perfectly 
with what we could have deduced by treating H as a sum of three one- 
dimensional harmonic-oscillator Hamiltonians. 
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Figure 7.7 Radial probability distributions of circular orbits in the three-dimensional 
harmonic oscillator potential for C = 1 and C = 8. The scale radius £ = \JTi/2mui. 


We shall define a circular orbit to be one that has the maximum an¬ 
gular momentum possible at its energy. We obtain the radial wavefunctions 
of these by writing the equation Ac\E, C) = 0 in the position representation. 
With equations (7.84) and (7.66) this equation becomes 


d 1 C +1 mu . . 

— H-h — r u c (r) = 0 . 

or r r n 


(7.95) 


This is a first-order linear differential equation. Its integrating 



so the solution of equation (7.95) is 

uc(t) = constant x r c e~ r , where £ = y/%/2mu. 


factor is 

(7.96) 


(7.97) 


Notice that the exponential factor is simply the product of three exponential 
factors from equation (3.15), one in x, one in y and one in z. The wavefunc- 
tion varies with r, so a circular orbit does have some radial kinetic energy. 
In the limit of large C in which classical mechanics applies, the radial kinetic 
energy is negligible compared to the tangential kinetic energy, and we neglect 
it. But it never really vanishes. 

Equation (7.97) gives the radial wavefunction for a circular orbit. The 
complete wavefunction is ?/>(x) = uc(r)Y™{Q, </>), and since f d 2 fIYJ n = 1 , 
the radial probability density is P(r) = r 2 u 2 c oc r 2C + 2 e ~ r / 2f - ; where the 
factor r 2 arises from the expression for the volume element d 3 x in spherical 
polar coordinates. This density is plotted in Figure 7.7 for C = 1 and C = 8. 
For r I £ < C \/2C + 2, P rises as r 2£+2 . For r/£ > y/2L + 2 it falls rapidly as 
the Gaussian factor takes over. Figure 7.7 shows that the uncertainty in r is 
~ £, which is a small fraction of r when C is not small. 

We may obtain the radial wavefunctions of more eccentric orbits by 
showing that A\ is a raising operator. Equation (7.89) yields 


AiHi — Hi + iAi + huA t 


(7.98) 


Daggering both sides we have 


HiA\ — A\Hi + \ + huAj. 


( 7 . 99 ) 
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Figure 7.8 The (E, l ) plane and the 
action of Ai and A] j. 


We now multiply both sides of Hi +1 \E,l + 1} = E\E,l + 1} by Aj: 

EA]\E,l + 1) = A}H l+1 \E,l + 1) = (Ht-hu)A]\E,l + l). (7.100) 

Rearranging 


Hi(Aj\E,l + l)) = {E + hw){A}\E,l + l)). (7.101) 

Thus, we have shown that 

Aj\E,l + l) = a + \E + hu>, l), (7.102) 

where a+ is a normalising constant. By writing Aj in the position repre¬ 
sentation, we can generate the wavefunctions of all non-circular orbits by 
repeatedly applying Aj to the current wavefunction, starting with that of 
a circular orbit. We start with the product of r c and a Gaussian factor 
[equation (7.97)]. From this A^ c _ 1 generates terms proportional to r c+l and 
r c ~ l times the Gaussian (Problem 7.25). From these two terms A]-_ 2 then 
generates three terms, r c+2 , r c and r c ~ 2 times the Gaussian, and so on. 
Consequently the number of radial nodes - radii at which the wavefunction 
vanishes - increases by one with each application of Aj, and the wavefunc¬ 
tion oscillates more and more rapidly in radius as Aj invests a larger and 
larger fraction of the particle’s kinetic energy in radial motion. 

Figure 7.8 helps to organise the generation of radial wavefunctions. Each 
dot represents a radial wavefunction. From the dot at ( E , /), operating with 
Ai carries one to the next dot up and to the left, while operating with Aj_ 1 
carries one to the next dot down and to the right. At half the energies 
only even values of l occur, and only odd values of l occur at the other 
half of the energies. In Problem 7.22 you can show that, when one bears in 
mind that each dot gives rise to 21 + 1 complete wavefunctions, the number 
of wavefunction with energy E that we obtain in this way agrees with the 
number that we would obtain using wavefunctions of the one-dinrensional 
harmonic oscillator via equation (7.79). 


7.4 Spin angular momentum 

In §7.2.1 we saw that the difference between J and L is that J is the gener¬ 
ator for complete rotations of the system, while L is the generator for dis¬ 
placements of the system around circles, while leaving its orientation fixed 
(Figure 7.3). Consequently the difference 

S = J — L (7.103) 

is the generator for changes of orientation that are not accompanied by any 
motion of the system as a whole. Since J and L are vector operators, S is 
also a vector operator. Its components are called the spin operators. 
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We saw in §7.2 that L has exactly the same commutation relations as 
J with any function of the position and momentum operators only. From 
this fact and the definition (7.103), it follows that S commutes with all such 
functions. In particular [S,x] = [S,p] = [S,L] = 0. This essentially tells us 
that S has nothing to do with a system’s location, nor the way in which it 
may or may not be moving. S is associated with intrinsic properties of the 
system. 

The components S- t of the spin operator inherit the usual angular mo¬ 
mentum commutation rules from Ji and L,: 

[S;. Sj] = [Ji - Li, J) - Lj] 

[Jit Jj\ [Lit Jj\ [Jit Lj\ T [Li, Lj[ 

— 1 'z ' C'ijk (Jk Lk Lk -\~ Lk) (7.104) 

k 

= 1 'z ' CijkSk- 
k 

We define S 2 = S • S and then equation (7.104) ensures that 


[S,5' 2 ]=0. (7.105) 

Because the S) have exactly the same form of commutation relations as the 
Ji, we know that the possible eigenvalues of S 2 are the numbers 0, 1, |,.., 

and that for given s the eigenvalues m of the Si move in integer steps from —s 
to s. Can s take half-integer values? This question is answered affirmatively 
by equation (7.103); since [ J Z ,L Z ] = 0 we can find a complete set of states 
that simultaneously have well-defined values of both J z and L z . In general, 
the J z eigenvalue could be either an integer or half-integer, whereas the L z 
eigenvalue must be an integer. The difference S z = J z — L z must then be 
either an integer or half-integer. 

In the rest of this book we will make extensive use of commutation 
relations involving angular momentum operators. In Table 7.3 these have 
been gathered for later reference. 


7.4.1 Spin and orientation 

We have several times stated without proof that the orientation of the system 
is encoded in the amplitudes aj m for the system to be found in states of well 
defined angular momentum, \j,m). We now begin to justify this claim. For 
simplicity we consider spin angular momentum because we want to focus on 
the orientation of our system without concerning ourselves with its location. 
However, what we refer to as ‘spin’ is the total intrinsic angular momentum 
of the system. If the latter is a hydrogen atom, for example, it may contain a 
contribution from the orbital angular momentum of the electron in addition 
to the contributions from the intrinsic spins of the electron and the proton. 

Since the S) are Hermitian operators, any state \ip) may be expanded 
in terms of the complete set of eigenstates |s,m) of, say, S z and S 2 . We 
have seen that these states are labelled by an integer or half integer s, with 
—s < m < s, so the complete expansion is 

S 

W)= {s,m\t()}\s,m). (7.106) 


Fortunately, systems for which quantum mechanical effects are significant 
rarely have more than a handful of non-zero amplitudes (s, ?n|i/’) in the sum 
of equation (7.106). In the simplest case we have an object with s = 0, a 
spin-zero object, such a pion. The sum in equation (7.106) contains only 
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Because there are only two terms in this expansion, the quantum uncertainty 
in the orientation of a spin-half system is very great. We shall see that the 
most precise information we can have is that the end of the system’s angular 
momentum vector lies in a given hemisphere - for example, we could state 
that it lies within the northern rather than the southern hemisphere, or the 
western rather than the eastern hemisphere. Where it lies in the hemisphere 
is shrouded in quantum uncertainty. 

Another important class of systems contains those that have total spin 
quantum number s = 1. These systems are called spin-one objects. The W 
and Z bosons fall in this class. For a spin-one system, the expansion (7.106) 
reduces to just (2s + 1) = 3 terms. For example, the state of a Z boson can 
be written 

l 

|Z)= 53 <l,m|Z)|l,m). (7.109) 

m— — 1 

We will see that we can constrain the end of the angular-momentum vector 
of a spin-one system to lie within a chosen polar cap, or in the equatorial 
band that lies between opposite polar caps. 

The larger a system’s spin s, the more precisely we can constrain the 
end of its angular momentum vector. It is rather as if systems were subject 
to random torques of a certain magnitude, and the faster it is spinning, 
the more stable its orientation can be in the face of the random torques. 
The same physical principle underlies the use of rifling in guns to stabilise 
the orientation of the projectile by imparting angular momentum to it as 
it flies down the barrel. A few concrete examples will clarify the physical 
interpretation of the quantum states |s,m). 


7.4.2 Spin-half systems 


As in equation (7.108), the state of any spin-half system may be expanded 
in terms of just two S z eigenstates \\,+\) and \\,—\) which we will call 
|+) and |—) respectively. Equation (7.108) then reads \tp) = a|+) + b \—). In 
this basis we can write the operators as (cf. equation 2.16) 5 


S T . 


G+l-S*|+> (+\S X \-)\ . q _f(+\Sy\+) <+l^|->\ 

V <-1-5x1+) VHS„|+> <-15,1 —)) 


C-(( + \Sz\+) (+\Sz\~)\ 
Z ~\(-\S Z \+) <—|5z|—} ) ’ 


(7.110) 


The elements of the matrix S z are trivially evaluated because |±) are the 
eigenkets of S z with eigenvalues ±|. To evaluate the other two matrices we 
notice that S x = ^(S + + S-), and S y = 5 -(5 + — S_), then use the relations 
S+|—) = |+) and SL|+) = |—) which follow from equations (7.5) and (7.7) 
for the spin operator. The result of these operations is 


O—l 

J oc — 2 


0 1 
1 0 


O—I 

°V — 2 


0 -i 
i 0 


5 — I ( 1 0 

z “ 2 \ 0 -1 


(7.111) 


The matrices appearing here are the Pauli matrices, 



so we can write S = ^cr. It is straightforward to verify that the 
any Pauli matrix is the identity matrix: 


(7.112) 
square of 


o\ = I. (7.113) 

This result implies that for any state = (S^) = ( S z ) = which is 

consistent with the fact that the measurement of any component of S can 
produce only ± 5 . 

The Stern—Gerlach experiment In 1922, Stern and Gerlach 6 conducted 


5 Here we are again slightly abusing the notation; Si are taken to be both the spin 
operators and their matrix representations. 

6 Gerlach, W. & Stern, O., 1922, Zeit. f. Physik , 9, 349 
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Figure 7.9 Schematic of a Stern- 
Gerlach filter. The atomic beam en¬ 
ters from the left. Between the pole 
pieces the magnetic field increases 
in intensity upwards, so atoms that 
have their spins aligned with B are 
deflected upwards and the other 
atoms are deflected downwards. 


oven 



Figure 7.10 Beam split by an SG 
filter and then up beam hits a sec¬ 
ond filter. 


some experiments with silver atoms that most beautifully illustrate the de¬ 
gree to which one can know the orientation of a spin-half object. In addi¬ 
tion to this interest, these experiments provide clear examples of the stan¬ 
dard procedure for extracting experimental predictions from the formalism 
of quantum mechanics. 

A silver atom is a spin-half object and has a magnetic dipole moment 
/Li. which can be used to track the atom’s orientation. In a magnetic field 
B, a magnetic dipole experiences a force V(/u • B). Consequently, in a field 
that varies in strength with position, a dipole that is oriented parallel to B 
is drawn in to the region of enhanced |B|, whereas one that is antiparallel 
to B is repelled from this region. Stern and Gerlacli exploited this effect to 
construct filters along the lines sketched in Figure 7.9. A powerful magnet 
has one pole sharpened to a knife edge while the other forms either a flat 
surface (as shown) or is slightly concave. With this geometry the magnetic 
field lines are close packed as they stream out of the knife edge, and then 
fan out as they approach the flat pole-piece. Consequently the intensity of 
the magnetic field increases towards the knife edge and the Stern—Gerlach 
filter sorts particles according to the orientation of their magnetic moments 
with respect to B. 

The experiments all start with a beam of sliver atoms moving in vacuo, 
which is produced by allowing vapourised silver to escape from an oven 
through a suitable arrangement of holes - see Figure 7.10. When the beam 
passes into a filter, FI, it splits into just two beams of equal intensity. We 
explain this phenomenon by arguing that the operator /.q associated with 
the i th component of an atom’s magnetic moment is proportional to Sp 
Pi = gSi. Hence the filter has ‘measured’ n • S, where n is the unit vector in 
the direction of B; we are at liberty to orient our coordinate system so that 
n = e z , and n • S = S z . We know that for a spin-half system, a measurement 
of S z can yield only ±i, so the splitting of the beam into two is explained. 
Given that there was nothing in the apparatus for producing the beam that 
favoured up over down as a direction for p, it is to be expected that half of 
the atoms return +i and half — so the two sub-beams have equal intensity. 
We block the sub-beam associated with S z = — \ so that only particles with 
S z = \ emerge from the filter. 

We now place a second Stern-Gerlach filter, F2, in the path of the |+) 
sub-beam, as shown in Figure 7.10, and investigate the effect of rotating the 
filter’s magnetic axis n in the plane perpendicular to the incoming beam’s 
direction. Let this be the yz plane. The incoming particles are definitely in 
the state 7 |+, z) because they’ve just reported on a measurement of S z . 


7 We relabel |+) -4 +, z) to make clear that this is a state with spin up along the 
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F2 measures n • S, where n = (0, sin#, cos 9) with 9 the direction between 
n and the z-axis. If |+,0) is the eigenket of n • S with eigenvalue + |, 
the amplitude that the measurement yields +\ is (+, #|+, z). The defining 
equation of |+, 9) is |n-cr|+, 9) = ||+, 9) or, using the matrix representation 
(7.111) 

( cos 9 —i sin 9\ ( a\ ( a \ 11 , 

-cose) (i) = (")’ < 7 ' 114 > 

where a = (+, z|+, 0) and b = (— ,z\+,9). We have to solve this equation 
subject to the normalisation condition |a| 2 + |6| 2 = 1. From the first row of 
the matrix we deduce that 


b . 1 — cos 9 
a sin# 


(7.115) 


From the trigonometric double-angle formulae we have 1 — cos 9 = 2 sin 2 1 9 
and sin 9 = 2 sin i# cos \6, so 


b sin b 9 

— = i- j — 

a cos ±9. 


(7.116) 


The choices 

( a \ _ f cos|# \ 

\b) ~ 


(7.117) 


satisfy both equation (7.116) and the normalisation condition. The ampli¬ 
tude that a particle with spin up along the z-axis also has spin up along the 
n-axis is a* = (+, #|+, z), so the probability that an atom will pass F2 is 


Pi = H 2 = cos 2 \9. 


(7.118) 


Thus, as 9 is increased from 0 to ir, the fraction of atoms that get through 
F2 declines from unity to zero, becoming 1 when 9 = 7r/2 and the magnetic 
axes of FI and F2 are at right angles. Physically it would be surprising if 
the fraction that passed F2 when 9 = 7t/ 2 were not a half since, when the 
magnetic moments of incoming atoms are perpendicular to the magnetic axis 
of a filter, there is nothing in the geometry of the experiment to favour the 
outgoing particles being parallel to the magnetic axis, rather than antipar¬ 
allel. When 9 = tt the magnetic axes of the filters are antiparallel and it is 
obvious that every atom passed by FI must be blocked by F2. This agrees 
with what we found out about a spin-half object’s orientation in the previous 
section; if it is pointing somewhere in the upper z hemisphere, then there is 
some chance it is also pointing in any other hemisphere apart from the 
one. 

We now place a third filter, F3, in the atomic beam that emerges from 
F2. Let (j> denote the angle between the magnetic axis of this filter and 
the z-axis. The atoms that emerge from F2 are in the state |+,0) because 
they’ve just returned ) on a measurement of n • S, so the amplitude that 
these atoms get through F3 is (+, <j>\+, 9). The amplitudes a' = (+, z|+, </>) 
and b' = (—,z \+,<fi) can be obtained directly from the formula we already 
have for (a, b) with <\> substituted for 9. Hence 


( a '\ _ ( cos !</> \ 

(b'J ~ ■ 

and the amplitude to pass F3 is 

(+,4>\+,9) = {+,(j>\s,z){s,z\+,9) 

S = ± 



= cos — 9). 


(7.119) 


(7.120) 


2 - axis. 
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Thus the amplitude to pass F3 depends only on the angle <f> — 0 between 
the magnetic axes of the filters, and the probability of passing F3 could 
have been obtained simply by substituting this angle into equation (7.118). 
This conclusion is obvious physically, but it is satisfying to see it emerge 
automatically from the formalism. 

An especially interesting case is when 0 = 7 t/2 and cf> = ir. In the absence 
of F2, F3 would now block every atom that passed FI. But with F2 present 
both F2 and F3 allow through half of the atoms that reach them, so a quarter 
of the atoms that leave FI with S z = +i pass both filters. These atoms exit 
from F3 with S z = — i. Introducing F2 changes the fraction of atoms that 
pass F3 because the measurement that F2 makes changes the states of the 
atoms. This is a recurring theme in quantum mechanics. No measurement 
can be made without slightly disturbing the system that is being measured, 
and if the system is small enough, the disturbance caused by a measurement 
can significantly affect the system’s dynamics. 


7.4.3 Spin-one systems 

In the case that s = 1, three values of m are possible, —1,0,1, and so the 
Si may be represented by 3 x 3 matrices. The calculation of these matrices 
proceeds exactly as for spin half, the main difference being that (7.5) and 
(7.7) now yield 


5+|-l) = v ' 2 | 0 ) ; S+| 0 ) = V 2 |l) ; 
5_|l) = v ' 2 | 0 ) ; 5_|0) = v ' 2 |-l) . 


The result is 


S x = 




s z 


( 1 0 

0 0 

\0 0 



(7.121) 


(7.122) 


Consider the effect of using Stern-Gerlach filters on a beam of spin- 
one atoms. In the experiment depicted in Figure 7.10 each filter now splits 
the incoming beam into three sub-beams, and we block all but the beam 
associated with m = +1 along the magnetic axis. One third of the atoms 
that emerge from the collimating slits get through the first filter FI because 
each value of m is equally probable at this stage . 8 To calculate the fraction 
of atoms which then pass through F2, the magnetic axis of which is inclined 
at angle 0 to that of FI, we must calculate the amplitude (l,0|l,z). The 
defining equation of |1,0) is n • S| 1, 0} = |1, 0), which with equations (7.122) 
can be written 


/ cos 0 
72 sin 0 

V 0 

where a = (1, z\l, 0), b = (0, z\l, 0), and c = (—1, z\l, 0). The first and third 
equations, respectively, yield 


-72 sin 0 


72 sin 0 


-72 sm 

— COS# 



(7.123) 


(cos # — 1 )& = —— sin # b and —— 
v 2 v 2 


sin# b = (1 + cos #)c. 


(7.124) 


8 The atoms emerge from the slits in an impure state (§6.3) and we set the probabilities 
for each value of m to 1 in order to maximise the Shannon entropy of that state (§6.3.2). 
The equality of the probabilities is an instance of the general principle that every quantum 
state has equal a priori probability. 
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Eliminating a and c in favour of b yields 


/ o\ 


(:) 

= b 



/~75 c °t(0/2)\ 

V 72tan(6>/2) / 


(7.125) 


The normalisation condition |a| 2 + |6| 2 + |c| 2 = 1 now implies that b = 
V / 2sin(i6 l ) cos( 50 ). The coefficient that we need is therefore 


(1, 9\l, z) = a = icos 2 (|0) = 5(1 + cos 9). (7.126) 


Hence the probability that an atom passes F2 after passing FI falls from unity 
when 9 = 0 to zero when 9 = n as we would expect. When 9 = ir/2 the 
probability is P 3 = j, which is substantially smaller than the corresponding 
probability of i found in (7.118) for the case of spin-half atoms. 

From a classical point of view it is surprising that after FI has selected 
atoms that have their angular momentum oriented parallel to the 3-axis 
(in the sense that S z takes the largest allowed value) there is a non-zero 
probability P 3 that the angular momentum is subsequently found to be, in 
the same sense, aligned with the y axis. The explanation of this phenomenon 
is that for this system, the value of S 2 is s(s+l) = 2 which is twice the largest 
allowed value of S z . Hence, even in the state |1,3) a significant component 
of the angular momentum lies in the xy plane. P 3 is the probability that 
this component is found to be parallel to the y axis. Once the measurement 
of S y has been made by F3, the atom is no longer in the state |1 ,z) and we 
are no longer certain to obtain 1 if we remeasure S z . 


7.4.4 The classical limit 

An electric motor that is, say, 1 cm in diameter and weighs about 10 grn 
might spin at 100 revolutions per second. Its angular momentum would then 
be ~ 10 _ 3 kgm 2 s _1 , which is ~ 10 31 7i. Thus classical physics works with 
extremely large values of the integers s, m. It is interesting to understand 
how familiar phenomena emerge from the quantum formalism when s is no 
longer small. 

For any value of s we can construct matrices that represent the angular 
momentum operators. The matrix for S z is diagonal with the eigenvalues 
s, (s—1),..., — s down the diagonal. The matrices for S x and S y are evaluated 
in the usual way from S+ and S— and so are zero apart from strips one 
place above and below the diagonal. Using the relations (7.15) between the 
coefficients a±(m) of the raising and lowering operators S± we then find 

0 0 \ 

0 0 1 


0 ce(s — 2) 0 

a(s — 2) 0 a(s — 1) 

0 a(s — 1) 0 / 


{ 0 a(s — 1) 0 

a(s — 1) 0 a(s — 2) 

0 a(s - 2) 0 


S x = 


0 

V 0 


= i + «(m - l)5 m , n+ i 


(7.127a) 
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Figure 7.11 The points show the absolute values of the amplitudes a m = (s, in, z\s, s, 8) 
for s = 40 and, from left to right, 9 = 120°, 80°, 30°. For each value of 8, the vertical line 
shows the value of cos 9. 



Sy 


1 

2i 


( ° 

— a(s — 1) 

0 


a(s — 1) 0 

0 a{s - 2) 

—a(s - 2) 0 


1 

2i 


0 

V o 

[ - a(m)t 


0 

0 

a(m - l)<5 m , n+ xj 


0 0 \ 

0 0 I 


0 a(s - 2) 0 

—a(s — 2) 0 a(s — 1) 

0 -a(s-l) 0 / 
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/s 0 ... 0 ... 0 0 \ 

0 s — 1 0 0 


5, 


0 


TO 


0 


00 l-s 0 

Vo 0 ... 0 ... 0 -s/ 

— TO 


(7.127c) 


where the a(m) are what were called a+{m) in (7.15), and the rows and 
columns of the matrix are labelled from +s at the top left to — s at the 
bottom right. In the same way as for spins s = ^ and s = 1, it is straight¬ 
forward (for a computer) to determine the amplitudes a m = ( s,m,z\s,s,9) 
for measuring S z to have value to, given that n • S certainly returns value s 
when n = (0, sind, cos 9) is inclined at angle 9 to the 2 -axis. The points in 
Figure 7.11 show the results of this calculation with s = 40 and three values 
of 6. The amplitudes peak around the values of the ordinate, m = scosO, 
that are marked with vertical lines. The larger the spin, the more sharply the 
amplitudes are peaked around these lines, so for the extremely large values of 
s that are characteristic of macroscopic systems, a m is significantly non-zero 
only when to differs negligibly from scos 9. Hence, in the classical limit the 
only values of S z that have non-negligible probabilities of occurring lie in 
a narrow range around s cos 9, which is just the projection of the classical 
angular-momentum vector (S) = sn onto the 2 -axis. That is, in the classical 
limit the probability of measuring any individual value of S z is small, but we 
are certain to find a value that lies close to the value predicted by classical 
physics. 
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The classical picture implies that when the angular-momentum vector is 
tipped in the yz plane at angle 9 to the z axis, the value of S y should be s sin 9. 
So now for the eigenstate of n • S just described, we evaluate the expectation 
value (S y ) by first multiplying the matrix for S y on the column vector of 
the amplitudes plotted in Figure 7.11, and then multiplying the resulting 
vector by the row vector of the complex conjugates of the amplitudes. The 
expectation value of S y in this state is 


(&y) — 


= ^7 ^2 a *m (a(m)5 mi n -1 + a(m - l)<5 m ,„+i) < 


(7.128) 


~ — ^ (a(m)a* m a m+ \ + a(m - l)a* m a m -\), 


bearing in mind that a(s) = a(—s — 1) = 0. For a given value of 9, the 
amplitudes plotted in Figure 7.11 lie on smooth curves so we can use the 
approximation |a m _i| ~ \a m \ ~ |a TO +i|. The phases of the a m increase by 
7 t /2 with successive values of to, so (S y ) is real and the two terms (7.128) 
add. Finally, we exploit the fact that \a m \ is small unless m ~ scosd and 
use the approximation for large s, to that 

a(m) = \Js(s + 1) — to(to + 1) ~ \/s 2 — to 2 ~ s sin 61 (7.129) 

Combining these approximations with the normalisation condition on the a m 
gives 

(S y ) ~ ssin# E |a m | 2 = ssind (7.130) 

m 

exactly as classical physics leads us to expect. 

To determine the uncertainty in S y we evaluate the expectation of Sy. 
From equation (7.127b) we find that the matrix Sy has elements 

S 

- 3 E { a ( m )^rn,p-i + a(m - l)(5m,p+i) (a(p)S p<n -i + a(p - l)<5p, n +i^ 

p——s 

— {a(m)a(m + l)i5 m , n _ 2 + a(m - 1 )a(m - 3)S mtn+2 

- (a 2 (to) + a 2 (to - 1)) S mn } 

(7.131) 

where in going to the second line we have ignored corrections when m = ±s 
because the amplitudes for these are negligible anyway. Using the same 
approximations as before, we now find 


<s„ 2 > = E<( s » 2 ) 


mn&n 


mn 


^2a 2 (m)\a m \ 2 ~ (ssind ) 2 ^ |a m | 2 

m m 

s 2 sin 2 9 ~ (S y ) 2 ■ 


(7.132) 


The uncertainty in S y , being ~ ( (S 2 ) — (S y ) 2 ) X ^ 2 is therefore negligible. A 
similar calculation shows that both (S x ) and ( S . 2 ) vanish to good accuracy. 
Thus in the classical limit it is normal for all three components of S to have 
small uncertainties. However, it should be noted that S y can be accurately 
determined precisely because there is some uncertainty in S z : our calculation 
on (S y ) depends crucially on there being several non-zero amplitudes a m . 
Quantum interference between states with different values of S z is responsible 
for confining the likely values of S y to a narrow range. 

This is the third time we have found that the familiar world re-emerges 
through quantum interference between states in which some observable has 



168 


Chapter 7: Angular Momentum 


well-defined values: in §2.3.3 we found that bullets can be assigned posi¬ 
tions and momenta simultaneously through interference between states of 
well-defined momentum, in §3.2 we saw that an excited oscillator moves as 
a result of quantum interference between states of well-defined energy, and 
now we find that a gyro has a well defined orientation through quantum 
interference between states of well-defined angular momentum. In the clas¬ 
sical regime a tiny fractional uncertainty in the value of an observable allows 
the vast numbers of states to have non-negligible amplitudes, and interfer¬ 
ence between these states narrowly defines the value of the variable that is 
canonically conjugate to the observable (§2.3.1). 

7.4.5 Precession in a magnetic field 

A compass needle swings to the Earth’s magnetic north pole because a mag¬ 
netic dipole such as a compass needle experiences a torque when placed in a 
magnetic field. Similarly, a proton that is in a magnetic field experiences a 
torque because it is a magnetic dipole. However, its response to this torque 
differs from that of a compass needle because it is a spinning body; instead of 
aligning with the magnetic field, it precesses around the field. This precession 
forms the basis for nuclear magnetic resonance (NMR) imaging, which has 
become an enormously important diagnostic tool for chemistry and medicine. 
The theory of NMR is a fine example of the practical application of quantum 
mechanics in general and spin operators in particular. 

Classically, the potential energy of a magnetic dipole ft in a magnetic 
field B is 

H = -n- B, (7.133) 

where the minus sign ensures that a dipole aligns with the field because this 
is its lowest-energy configuration. We align our coordinate system such that 
the z axis lies along B and assume that the magnetic moment operator p 
is a constant 2p p times the spin operator s. Then the Hamiltonian operator 
can be written 

H = -2p p Bs z . (7.134) 

The stationary states of this Hamiltonian are the eigenstates of s z , which for a 
spin-half particle such as a proton are the states |±) in which a measurement 
of s z is certain to yield ±|; the energies of these states are 


E± = T H P B. (7.135) 

The evolution in time of any spin state is 

\^,t) = a.e- iE -^ h \~) + a + e~ iE + t / R \+), (7.136) 

where the constant amplitudes a± specify the initial condition |^>, 0) = 
a_|—) + a+|+). 

Suppose that initially a measurement of the spin parallel to n = (sin 9, 0, cos 9) 
was certain to yield i. Then from Problem 7.6 we have that a_ = sin(0/2) 
and a+ = cos(0/2). Hence at time t the proton’s state is 

IVb t) = sin(0/2)e- iS -^|-} + cos(9/2)e- iE ^ n \+) 

7.137a 

= sin(0/2)e^ /2 |—) + cos(0/2)e-^ /2 |+), 

where 

<t>(t) = where u> = — . (7.137b) 

n n 

But from Problem 7.6 this is just the state |+,n') in which a measurement 
of the spin parallel to n' = (sin d cos <(>, sin^sin^cos#) is certain to yield 
Consequently, the direction in which a measurement of the spin is certain to 
yield \ rotates around the direction of B at the frequency u>. This mirrors 
the behaviour expected in classical physics of a magnetic dipole of magnitude 
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/i p that has spin angular momentum its spin axis would precess around 
B at angular frequency u> (Problem 7.12). 

When material that contains chemically bound hydrogen atoms is im¬ 
mersed in a powerful magnetic field, most of the protons align their spins 
with B in order to minimise their energy. Radiation of frequency oj has 
just the energy required kick a proton into the higher-energy state in which 
its spin is anti-aligned with B. Consequently, such radiation is readily ab¬ 
sorbed by a sample, whereas radiation of neighbouring frequencies is not. 
As the analysis above shows, quantum interference between the aligned and 
anti-aligned states causes the expectation value of the magnetic moment to 
precesses at angular frequency u>, and the precessing magnetic moment cou¬ 
ples resonantly to the imposed radiation field. 

The magnetic field at the location of a proton in a molecule has a con¬ 
tribution from the spins of the electrons that bind the proton, and this 
contribution varies slightly from one location to another. For example, in 
methanol, CH 3 OH, the magnetic field experienced by the proton that is at¬ 
tached to the oxygen atom differs from those experienced by the protons that 
are attached to the carbon atom, and the proton that is on the other side 
of the carbon atom from the oxygen atom experiences a different field from 
the protons that are adjacent to the oxygen atom. Since the frequency u> of 
the resonant radiation is proportional to the magnitude of magnetic field at 
the location of the proton, methanol has three different resonant frequencies 
for a given magnitude of the imposed magnetic field. Consequently, clues to 
the chemical structure of a substance can be obtained by determining the 
frequencies at which magnetic resonance occurs in a given imposed field. 


7.5 Addition of angular momenta 

In practical applications of quantum mechanics we can often identify two 
or more components of the system that each carry a well defined amount 
of angular momentum. For example, in a hydrogen atom both the proton 
and the electron carry angular momentum by virtue of their spins, and 
a further quantity of angular momentum may be present in the orbit of the 
electron around the proton. The total angular momentum of the atom is 
the sum of these three contributions, so it is important to understand how 
to add angular momenta in quantum mechanics. Once we understand how 
to add two contributions, we’ll be able to add any number of contributions, 
because we can add the third contribution to the result of adding the first 
two, and so on. Therefore in this section we focus the problem of adding the 
angular momenta of two ‘gyros’, that is two systems that have unvarying total 
angular momentum quantum number j but several possible orientations. 

Imagine that we have two gyros in a box and that we know that the first 
gyro has total angular-momentum quantum number j \, while the second gyro 
has total quantum number j 2 - Without loss of generality we may assume 
j 1 > j 2 . A ket describing the state of the first gyro is of the form 


h 

1^1}= 5Z Cm \ (7.138a) 

m=-j 1 

while the state of the second is 

32 

1^2)= 5Z d m\j2,m), (7.138b) 

m=-j 2 


and from the discussion in § 6.1 it follows that the state of the box is 


IV’) = |'0l)IV’2)- 


( 7 . 139 ) 
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The coefficients c m and d rn are the amplitudes to find the individual gyros 
in particular orientations with respect to the 2 axis. For example, if both 
gyros are maximally aligned with the z axis, we will have || = \dj 2 \ = 1 
and c mi = d m2 = 0 for mi £ j\ and m 2 ^ ji- 

The operators of interest are the operators J 2 , J iz and J i± of the ?' th 
gyro and the corresponding operators of the box. The operators J z and J± 
for the box are simply sums of the corresponding operators for the gyros 

Jz = J\z + Jiz ; J± — Ji± + J‘2± ■ (7.140) 

Operators belonging to different systems always commute, so [Ju, Jij) = 0 
for any values of i,j. The operator for the square of the box’s angular 
momentum is 

J 2 = (Ji + J 2 ) 2 = J 2 + J 2 + 2J!.J 2 . (7.141) 

Now 


J 1 +J 1 - — ( J\x + lJ\y){Jlx ‘^Jly ) (J 142) 

— (J'lx J‘2x A JlyJly) A l(JlyJlx JlxJly)- 

The expression for Ji_ J 2+ can be obtained by swapping the labels 1 and 2, 
so 9 

Ji+J 2 _ + J 1 -J 2 + A ‘iJizJiz = 2Ji.J 2 . (7.143) 

Using this expression to eliminate Ji.J 2 from (7.141) we obtain 

J 2 — J 2 A J 2 A J\+ Ji— A J 1 -J 2 + + 2 Ji z J 2z . (7.144) 

While the total angular momenta of the individual gyros are fixed, that 
of the box is variable because it depends on the mutual orientation of the 
two gyros: if the latter are parallel, the squared angular momentum in the 
box might be expected to have quantum number j\ + j 2 , while if they are 
antiparallel, the box’s angular momentum might be expected to have quan¬ 
tum number j\ — j 2 . We shall show that this conjecture is true by explicitly 
calculating the values of the coefficients c m and d m for which the box is in an 
eigenstate of both J 2 and J z . We start by examining the state |ji, ji) 
in which both gyros are maximally aligned with the 2 axis. It is easy to see 
that this object is an eigenket of J z with eigenvalue ji + j 2 . We use (7.144) 
to show that it is also an eigenket of J 2 : 

J 2 \ji,ji)\ji,h) = {J\ + Ji + J\+Ji~ + J 1 -J 1 + + 2Ji z Jiz)\3i,jr)\ji,3i) 

= {Ji (ji + 1) +ji{ji + 1) + 2jij 2 }|ji, ji)\ji, ji), 

(7.145) 

where we have used the equation Ji+\ji,ji) = 0, which follows from equation 
(7.7). It is straightforward to show that the expression in curly brackets in 
equation (7.145) equals j(j + l) with j = ji+j 2 . Hence \jx,j-i)\ji,ji) satisfies 
both the defining equations of the state |ji +j 2 , ji + j 2 ) and we may write 

lii + ji , ji +ji) = Ui . ji)\ji , ji) • (7-146) 

Now that we have found one mutual eigenket for the box of J 2 and J z 
we can easily find others by applying J_ to reorient the angular momentum 
of the box away from the 2 axis. Again setting j = j% + j 2 we evaluate the 
two sides of the equation 

J-\j,j) = (Ji- + Ji-)\ji,ji)\ji,ji)- (7-147) 

Equation (7.7) enables us to rewrite the left side 

J-\j,j) = Vdti + 1 )~j(j ^ 1) \j,j - 1) = V2j \j,j - I)- (7.148) 


9 Recall that Ju commutes with J2j for all ij. 
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Figure 7.12 The left panel shows states obtained by adding a system of angular momen¬ 
tum j2 = 1 to one with j\ = 2 , while the right panel is for j i = 1 and ]2= y • 


The right side of (7.147) becomes 


x/jiCn +!) - ii(ii - i)lii ,h - i>|j2,j'2) 

+ Vh(h + 1 ) - h(h - l)|ji, ji)|i 2 ,h - 1 ) (7.149) 

= V / 2ji \ji,ji ~ l)b‘2,J2> + V^h\ji,ji)\j2,j2 - !)• 


Putting the two sides back together, we have 


1 3,3 - !) = \ — ljl.il - l)|j2,J2> + \ —\jl,jl)\h,j2 - 1). 


(7.150) 


A further application of J_ to the left side of this equation and of J 1 _ + J 2 _ 
to the right side would produce an expression for | j, j — 2) and so on. 

Figure 7.12 helps to organise the results of this calculation. States of the 
box with well defined angular momentum are marked by dots. The radius 
of each semi-circle is proportional to j' , where j'(j' + 1 ) is the eigenvalue of 
the kets with respect to J 2 . The height of each ket above the centre of the 
circles is proportional to m. The left panel shows the case j\ = 2, j 2 = 1, 
while the right panel is for j\ = 1, j 2 = -1. The scheme for constructing 
eigenstates J 2 and J z that we have developed so far starts with the state at 
the top and then uses to successively generate the states that lie on the 
outermost semi-circle. 

We now seek an expression for the state | j — 1, j — 1) that lies at the top 
of the first semicircle inwards. It is trivial to verify that \j\, mi)\j 2 , m 2 ) is an 
eigenket of J z with eigenvalue (?ni +TO 2 ). We require m\ +m 2 = j\ +j 2 — 1, 
so either mi = j 1 — 1 and m 2 = j 2 , or m\ = j\ and m 2 = j 2 — 1 . Equation 
(7.150) shows that \j,j — 1) involves precisely these two cases, and must be 
orthogonal to |j — 1 , j — 1) because it has a different eigenvalue with respect 
to J 2 . So the ket we seek is the unique (up to an overall phase factor) linear 
combination of the kets appearing in (7.150) that is orthogonal to the linear 
combination that appears there. That is, 


Ij-l.j- 1 ) = \ — Iji.ji - I)|j2,j2) - \ — |jl,jl)|j2,j2 - I)- (7.151) 


All the kets | j — 1, to) for m = j — 2,..., which in Figure 7.12 lie on the first 
semicircle in, can be constructed by applying </_ to this equation. 

Similarly, \j — 2, j — 2), which in Figure 7.12 lies at the top of the smallest 
semicircle, will be a linear combination of | j \, j 1 — 2 ) \j 2 , j' 2 ), | ji, ji — 1 ) | j 2 , j 2 — 
1 ) and |ii, Ji>|j 2 , J 2 — 2 ) and must be orthogonal to |j, j- 2 ) and |j-l, j- 2 ), 
which are known linear combinations of these states. Hence we can determine 
which linear combination is required for | j — 2,j — 2 ), and then generate the 
remaining kets of the series | j — 2 ,?tt.) by applying to it. 

On physical grounds we would expect the box’s smallest total angular 
momentum quantum number to be j\ — j 2 , corresponding to the case in 
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Table 7.4 

Total numbers of states 

3 

Number of states 

jl + 32 

2(ji + 32 ) + 1 

ji +32-1 

2(ji + 32 ) + 1 — 2 

jl - 32 

2(ji + J 2 ) + 1 - 4 j 2 

Total 

(2ji + 1)(2 j 2 + 1) 


h 



Figure 7.13 Interpretation of Clebsch- 

Gordan coefficients in terms of vec- _ 

tors. The full line has length >/3(3 + 1) 
and its vertical component has length 
2. The dotted lines labelled ji have 
length a/ 2(2 + 1) and vertical com¬ 
ponents of length 2 and 1. 


which the two gyros are antiparallel (recall that we have labelled the gyros 
such that j\ > j 2 ). Does this conjectured smallest value of j allow for the 
correct number of basis states for the box? That is, will there be as many 
basis states of the box as there are of the contents of the box? We can easily 
evaluate the latter: there are 2ji + 1 orientations of the first gyro, and for 
each of these orientations, the second gyro can be oriented in 2j 2 + 1 ways. 
So the box’s contents can be in (2ji + 1)(2 j 2 + 1) basis states of the form 
Ui,mi}|j 2 ,m 2 ). The predicted number of basis states of the box is worked 
out in Table 7.4. In the main part of the table, the number of states in each 
row is two less than in the row above and there are 2 j 2 + 1 rows. The sum 
at the bottom can be obtained by making a third column that is just the 
second column in reverse order and noting that the sum of the numbers in 
the second and third columns of a given row is then always 4j-j + 2. Hence 
twice the sum of the numbers in the second column is 2 j 2 + 1 times 4ji + 2. 
Thus we do get the correct number of basis states if the smallest value of j 
is ji ~ J 2 - 

The numbers 


C{j,m;ji,j 2 ,m 1 ,m 2 ) = (j, m\ji, mi)\j 2 , m 2 ) (7.152) 

that we have been evaluating are called Clebsch Gordan coefficients. They 
have a simple physical interpretation: C(j, to; ji,j 2 , mi, to 2 ) is the amplitude 
that, on opening the box when it’s in a state of well defined angular momen¬ 
tum, we will find the first and second gyros to be oriented with amounts TOi 
and to 2 of their spins parallel to the 2 axis. For example, equation (7.151) 
implies that C(3,2; 2,1,1,1) = y^2/3, so if a box that contains a spin-two 
gyro and a spin-one gyro has spin-three, there is a probability 2/3 that on 
opening the box the second gyro will be maximally aligned with the z axis 
and the second significantly inclined, and only a probability 1/3 of finding 
the reverse arrangement. These two possibilities are depicted by the lower 
and upper dotted lines in Figure 7.13. The classical interpretation is that 
the two gyros precess around the fixed angular-momentum vector of the box, 
and that the two configurations for which the Clebsch-Gordan coefficients 
give amplitudes are two of the states through which the precession carries the 
system. This picture is intuitive and of some value, but should not be taken 
too seriously. For one thing, the rules for adding angular momentum are 
independent of any statement about the Hamiltonian, and therefore carry 
no implication about the time evolution of the system. The gyros may or 
may not precess, depending on whether they are dynamically coupled. 
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In §6.1 we saw that the physical significance of the state of a composite 
system, such as that formed by two gyros, being a linear combination of 
product states such as | ji, uii)|j 2 , mi) is that the subsystems are correlated. 
The Clebsch-Gordan coefficients encode the correlations between the gyros 
required for the box to have well-defined angular momentum. If there is any 
uncertainty in the orientation of either gyro, such correlations are essential 
if the angular momentum of the box is to be well defined: the angular mo¬ 
mentum of the second gyro has to make up a pre-defined total with whatever 
value is measured for the first gyro. This consideration explains why the only 
states of the box that are simple products of states of the individual gyros are 
|ji +j 2 ,ji +ji) = \ji,ji)\j 2 ,j 2 ) and |ji +j 2 ,~(ji +j 2 )) = \ji,-ji)\j 2 ,-j 2 ) 
- so much angular momentum can be aligned with the 2 -axis only by each 
gyro individually straining to align with the axis, and there is then no need 
for the gyros to coordinate their efforts. 


7.5.1 Case of two spin-half systems 

The general analysis we have just given will be clarified by working out some 
particular cases. We consider first the case j\ = j% = ^, which is relevant, for 
example, to a hydrogen atom in its ground state, when all angular momentum 
is contributed by the spins of the proton and the electron. The electron has 
base states |±,e) in which J z returns the value ±|, while the proton has 
corresponding base states |±,p). Hence there are four states in all and j 
takes just two values, 1 and 0. 

Our construction of the states in which the atom has well-defined angular 
momentum starts with the state 

|l,l) = |+,e)|+,p) (7.153) 

in which both the electron and the proton have their spins maximally aligned 
with the 2 axis. So the atom has maximum angular momentum, and its 
angular momentum is maximally aligned with the 2 axis. Applying J_ = 
Jf. + j£ to this ket we obtain 

M> = ^2 (l~> e )l+>P) + e )I >p)) • (7.154) 

The right side of this equation states that with the atom in this state, mea¬ 
surements of J z for the electron and proton are certain to find that they 
are ‘antiparallel’. This fact is surprising given that the left side states that 
the atom has maximum angular momentum, so you would think that the 
two particles had parallel angular momenta. The resolution of this paradox 
is that the 2 components of the two spins are antiparallel, but the compo¬ 
nents in the xy plane are parallel, although their direction is unknown to 
us. Similarly, when the atom is in the state 11,1) of equation (7.153), the 
2 components of the electron and proton angular momenta are parallel, but 
the components in the xy plane are not well aligned. The poor alignment in 
the xy plane explains why yff* = y/2 for the atom is less than ^3, which is 
the sum of \TP = i/3/4 for the electron and the proton. 

When we apply J_ to |1, 0) we obtain 

|l,-l) = |-,e)|-,p). (7.155) 

This equation confirms the physically obvious fact that if we want to have 
h of angular momentum pointing along the negative 2 axis, we need to have 
the angular momenta of both the proton and the electron maximally aligned 
with the negative 2 axis. 

The remaining state of the atom is |0, 0) in which the atom has no angu¬ 
lar momentum. This is the unique linear combination of the two compound 
states on the right of equation (7.154) that is orthogonal to |1, 0): 

l°,°} = ^(|-.e>|+,p)-|+,e)|-,p)). 


(7.156) 
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The change of sign on the right of this equation from the right of equation 
(7.154) for 11, 0) ensures that the spins of the electron and proton are antipar¬ 
allel in the xy plane as well as along the z axis. We show this by rewriting 
11,0) and |0, 0) in terms of the states in which the electron and proton have 
well-defined spin parallel to the rr-axis. These states are 


k+i 6 ) — ^2 (I++) + l — > e )) 
k+iP) = ^ (l+,p> + Kp» 


k-, e ) = ^2 (| +, e )- | -, e ) ) 

k-» p ) = (l+,P) - |-,P» 


(7.157) 


So 


|0,0) = |®+, e)(ai+, e|0,0) + \x—, e){x—, e|0,0) 

= 5k+> e ) (-h)P) + l+>P» - (|-,P) + l+iP)) 

= (|x+,e)|x-,p) + |x-,e)|x+,p)). 


(7.158) 


The last line states that when the atom is in the state |0,0) we are indeed 
guaranteed to find the components of the spins of the electron and proton 
parallel to x have opposite signs. An analogous calculation starting from 
equation (7.154) yields (Problem 7.27) 

|1,0) = -^ (|aH-, e)|ai+, p) - \x~, e)|x—, p)), (7.159) 

so when the atom is in the |1, 0) state the two particles have identical com¬ 
ponents of spin along x . 

Notice that all three states in which the atom has j = 1 are unchanged 
if we swap the m values of the particles - that is, if we map |±,e) —> |=F, e) 
and the same for the proton states. The atomic atomic state with j = 0, 
by contrast, changes sign under this interchange. This fact will prove to be 
important when we consider systems with two electrons (such a helium) or 
two protons (such as an H2 molecule). 


7.5.2 Case of spin one and spin half 

In the first excited state of hydrogen, the electron can have total orbital 
angular momentum quantum number 1 = 1. So we now consider how to 
combine angular momentum j = 1 with the electron’s spin, j = \- The total 
angular momentum quantum number takes two values, j = | and j = \ (see 
Figure 7.12). We start with the state 

If,f> = l+)IM> (7+60) 

in which the spin and orbital angular momenta are both maximally oriented 
along the z axis. Applying J_ = L_ + SL to this equation, we obtain 

lf4> = V^|->l 1 > 1 > + -\/§l+>l 1 - 0 >- (7- 161 ) 

The right side of this equation says that in this state of the atom, the electron 
is twice as likely to be found with its spin up as down. A second application 
of J_ yields 

!§.-§> = v / ll->l 1 ’°) + \/Jl + >l 1 ’ _1 > (7 ' 162) 

as we would expect from the symmetry between up and down. A final 
application of J_ yields — |) = | —)11, — 1) as it must on physical grounds. 
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Figure 7.14 Classically the sum 
vector + J 2 can line anywhere on 
the sphere of radius \7J 2 \ around the 
end of 


The state |~, ^) is the linear combination of the states that appear in 
the right of equation (7.161) that is orthogonal to ||, ^). Hence, 

\bh) = \fi |-)lt l)-y/l |+)|1,0). (7.163) 

In this atomic state, the electron’s spin is twice as likely to be down as up. 
The last remaining state can be found by applying J_ to equation (7.163). 
It is _ __ 

\b-h) = VTl-)! 1 ’ 0 ) - ( 7 - 164 ) 


7.5.3 The classical limit 

In classical physics we identify angular momentum with the vector J = Ti (J), 
and the angular momentum of the whole system is obtained by vectorially 
adding the angular momenta J\ and J 2 of the component parts. If 9 is the 
angle between these vectors, then 

J 2 = Jl + + 2 J X J 2 cos 9. (7.165) 

If nothing is known about the direction of J 2 relative to 3\ , all points on a 
sphere of radius J 2 and centred on the end of J\ are equally likely locations 
for the end of J 2 (Figure 7.14). Consequently, the probability dP that 9 lies 
in the range (9, 9 + d 9) is proportional to the area of the band shown in the 
figure. Quantitatively 

dP=lsin0d0, (7.166) 

where the factor 1 ensures that f d P = 1. From equation (7.165) the change 
in J when 9 changes by d 9 is given by 

JdJ = ~J ± J 2 sin 9d9. (7.167) 

Combining equations (7.166) and (7.167), we find that the probability that 
the total angular momentum lies in the interval (J7, J + dj) is 10 

d P=bL^L. (7.168) 

2J1J2 


10 We discarded the minus sign in equation (7.167) because we require dP > 0 regardless 
of whether J increases or decreases. 
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In quantum mechanics the fraction of states that have total angular- 
momentum quantum number j is 


= 2j + l 

(2ji + l)(2j2 + 1) 


(7.169) 


which in the classical limit of large quantum numbers becomes approximately 
j/(2jij 2 )- If all states were equally likely, this fraction would equal the 
classical probability that J ~ jfi lay within fi of jfi. It is easy to check 
from (7.167) that d P does indeed take the value / when we insert Ji = hji 
and d J = Ti. Thus from consistency with classical mechanics we are led 
to the principle of equal a priori probability, namely that when we 
have no information relevant to an upcoming measurement, we assign equal 
probabilities to the system being in each state of whatever basis we have 
decided to work in. This principle is the foundation of all statistical physics. 


Problems 

7.1 Show that (j,j\J x \j,j) = {j,j\J y \j,j) = 0 and that {j, j\{J^+Jy)\j, j) = 
j . Discuss the implications of these results for the uncertainty in the orien¬ 
tation of the classical angular momentum vector J for both small and large 
values of j. 

7.2 In the rotation spectrum of 12 C 16 0 the line arising from the transition 
l = 4 —> 3 is at 461.04077 GHz, while that arising from l = 36 —> 35 is at 
4115.6055 GHz. Show from these data that in a non-rotating CO molecule 
the intra-nuclear distance is s ~ 0.113 nm, and that the electrons provide 
a spring between the nuclei that has force constant ~ 1904Nm -1 . Hence 
show that the vibrational frequency of CO should lie near 6.47 x 10 13 Hz 
(measured value is 6.43 x 10 13 Hz). Hint: show from classical mechanics 
that the distance of O from the centre of mass is |s and that the molecule’s 
moment of inertia is -^m p s 2 . Recall also the classical relation L = Iuj. 

7.3 Show that Li commutes with x • p and thus also with scalar functions 
of x and p. 

7.4 Write down the expression for the commutator [cx, ; , ay] of two Pauli 
matrices. Show that the anticommutator of two Pauli matrices is 


{ (jj . (jj | — 2 Sj j . 


(7.170) 


7.5 Let n be any unit vector and er = (<T x ,a y ,<T z ) be the vector whose 
components are the Pauli matrices. Why is it physically necessary that n • er 
satisfy (n • er) 2 = I, where I is the 2x2 identity matrix? Let m be a 
unit vector such that m • n = 0. Why do we require that the commutator 
[m • er, n • er] = 2i(m x n) • <r? Prove that that these relations follow from the 
algebraic properties of the Pauli matrices. You should be able to show that 
[m • er, n • er] = 2i(m x n) • er for any two vectors n and m. 

7.6 Let n be the unit vector in the direction with polar coordinates (0, </>). 
Write down the matrix n • er and find its eigenvectors. Hence show that the 
state of a spin-half particle in which a measurement of the component of spin 
along n is certain to yield i h is 

|+, n) = sin(0/2) e 1 ^ 2 ! —) + cos(0/2) e -I< ^ 2 |+), (7.171) 

where |±) are the states in which is obtained when s z is measured. 
Obtain the corresponding expression for | —, n). Explain physically why the 
amplitudes in (7.171) have modulus 2 -1 / 2 when 0 = 7 t /2 and why one of the 
amplitudes vanishes when 6 = tt. 
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7.7 For a spin-half particle at rest, the rotation operator J is equal to the 
spin operator S. Use the result of Problem 7.4 to show that in this case the 
rotation operator U{a) = exp(—ict • J) is 

U(a) = I cos ^ — id: • crsin ^ , (7.172) 

where d is the unit vector parallel to a. Comment on the value this gives 
for U(a) when a = 2 tt. 

7.8 Write down the 3x3 matrix that represents S x for a spin-one system 
in the basis in which S z is diagonal (i.e., the basis states are |0) and |±) with 
<Sz|+) = |+)) etc.) 

A beam of spin-one particles emerges from an oven and enters a Stern- 
Gerlach filter that passes only particles with J z = h. On exiting this filter, 
the beam enters a second filter that passes only particles with J x = h , and 
then finally it encounters a filter that passes only particles with J z = —h. 
What fraction of the particles stagger right through? 

7.9* Repeat the analysis of Problem 7.8 for spin-one particles coming on 
filters aligned successively along +z, 45° from z towards x [i.e. along (1,0,1)], 
and along x. 

Use classical electromagnetic theory to determine the outcome in the 
case that the spin-one particles were photons and the filters were polaroid. 
Why do you get a different answer? 

7.10 A system that has spin momentum yjbh, is rotated through an angle <j> 
around the 2 axis. Write down the 5x5 matrix that updates the amplitudes 
a m that S z will take the value m. 

7.11 Justify physically the claim that the Hamiltonian of a particle that 
precesses in a magnetic field B can be written 

H = —2/is • B. (7.173) 

In a coordinate system oriented such that the 2 axis is parallel to B, a 
proton is initially in the eigenstate |+,a;) of s x . Obtain expressions for the 
expectation values of s x and s y at later times. Explain the physical content 
of your expressions. 

Bearing in mind that a rotating magnetic field must be a source of 
radiation, do you expect your expressions to remain valid to arbitrarily late 
times? What really happens in the long run? 

7.12 Show that a classical top with spin angular momentum S which is 
subject to a torque G = //S x B/| S| precesses at angular velocity u) = /iB/1S|. 
Explain the relevance of this calculation to magnetic resonance imaging in 
general and equation (7.137b) in particular. 

7.13* Write a computer programme that determines the amplitudes a m in 


S 

|n; s, s) = ^2 a m \s,m) 

m=—s 


where n = (sin#,0, cos#) with # any angle and |n; s, s) is the ket that solves 
the equation (n • S)|n; s, s) = s|n; s, s ). Explain physically the nature of this 
state. 

Use your a m to evaluate the expectation values (S x ) and (S^) for this 
state and hence show that the RMS fluctuation in measurements of S x will 
be \Jsj2 cos#. 
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7 . 14 * We have that 


L + = L x + \Ly = e 10 (J^ + icot 6-^. ( 7 . 174 ) 

From the Hermitian nature of L z = —ii d/d<f> we infer that derivative operators 
are anti-Hermitian. So using the rule (ABy = B^A^ on equation ( 7 . 174 ), 
we infer that 

This argument and the result it leads to is wrong. Obtain the correct result 
by integrating by parts f A9 sind f A<p ( f*L + g ), where / and g are arbitrary 
functions of 9 and <f>. What is the fallacy in the given argument? 

7.15* By writing Ti 2 L 2 = (x x p) • (x x p) = Y^ijklm e ijkXjPk eumXiPm show 
that 

P 2 = ^^ + ^{( r 'P) 2 - iSr 'P}- ( 7 - 175 ) 

By showing that p • r — r • p = —2ih/r, obtain r • p = rp r + i Ti. Hence obtain 


2 2 
P =Pr 


h 2 L 2 


(7.176) 


Give a physical interpretation of one over 2m times this equation. 

7.16 The angular part of a system’s wavefunction is 


(9, (j)\ijj) (x (y/2 cos 9 + sinde 1<?i — sin 9e'^). 


What are the possible results of measurement of (a) L 2 , and (b) L z , and 
their probabilities? What is the expectation value of L Z 1 

7.17 A system’s wavefunction is proportional to sin 2 9e 2u ^. What are the 
possible results of measurements of (a) L z and (b) L 2 ? 

7.18 A system’s wavefunction is proportional to sin 2 9. What are the pos¬ 
sible results of measurements of (a) L z and (b) A 2 ? Give the probabilities of 
each possible outcome. 

7.19 Consider a stationary state \E, l} of a free particle of mass m that has 
angular-momentum quantum number l. Show that Hi\E, l) = E\E, l ), where 




l(l + l)h 2 \ 

r 2 ) ' 


(7.177) 


Give a physical interpretation of the two terms in the big bracket. Show that 
Hi = AjAi, where 


Ai = 






(7.178) 


Show that [Aj, Aj] = Hi + \ — Hi. What is the state Ai\E,l)? Show that for 
E > 0 there is no upper bound on the angular momentum. Interpret this 
result physically. 

7 . 20 * Show that [. J z ,Lj] = i an d [Ji, L 2 ] = 0 by eliminating L z 

using its definition L = T*x x p, and then using the commutators of 
with x and p. 
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7.21* In this problem you show that many matrix elements of the position 
operator x vanish when states of well defined l,m are used as basis states. 
These results will lead to selection rules for electric dipole radiation. First 
show that [L 2 ,Xi] = i Cji k (LjX k + x k Lj). Then show that L • x = 0 and 
using this result derive 

[L 2 ,[L 2 ,Xi}\ =i^2e jik (L j [L 2 ,x k ] + [L 2 ,x k ]L j ) = 2(L 2 Xi + XiL 2 ). (7.179) 

jk 


By squeezing this equation between angular-momentum eigenstates ( l,m\ 
and | l',m') show that 


0 = {(/3 — /3') 2 - 2(/3 + p')}(l,m\xi\l',m'), 


where f} = 1(1 + 1) and f}' = l’(l' + 1). By equating the factor in front of 
(l, m\xi\l', m 1 ) to zero, and treating the resulting equation as a quadratic 
equation for j3 given /?', show that (l, m\xi\l', m!) must vanish unless l + l' = 
0 or l = l' ± 1. Explain why the matrix element must also vanish when 
1 = 1 ' = 0 . 


7.22* Show that l excitations can be divided amongst the x, y or z oscilla¬ 
tors of a three-dimensional harmonic oscillator in (^l + 1)(Z +1) ways. Verify 
in the case l = 4 that this agrees with the number of states of well defined 
angular momentum and the given energy. 


7.23* Let 


Ai = 


y/2mhix 


(/ +1 )h 

i p r -b moor 


(7.180) 


be the ladder operator of the three-dimensional harmonic oscillator and | E, l) 
be the oscillator’s stationary state of energy E and angular-momentum quan¬ 
tum number l. Show that if we write Ai\E,l) = a-\E — huj,l + 1), then 
a_ = \J £ — l, where £ is the angular-momentum quantum number of a cir¬ 
cular orbit of energy E. Show similarly that if \ E,l) = a + \E + Tiuj , l — 1 ), 
then a+ = \/£ — l + 2. 


7.24* Show that the probability distribution in radius of a particle that 
orbits in the three-dimensional harmonic-oscillator potential on a circular 
orbit with angular-momentum quantum number l peaks at r/£ = \/2{l + 1), 
where 


£ = 



(7.181) 


Derive the corresponding classical result. 


7.25* A particle moves in the three-dimensional harmonic oscillator poten¬ 
tial with the second largest angular-momentum quantum number possible at 
its energy. Show that the radial wavefunction is 


^' x — ^ e x / 4 where x = r / 1 with 


U\ (X x \ x — 


How many radial nodes does this wavefunction have? 



(7.182) 


7.26 A box containing two spin-one gyros A and B is found to have angular- 
momentum quantum numbers j = 2, m = 1. Determine the probabilities 
that when J z is measured for gyro A, the values m = ±1 and 0 will be 
obtained. 

What is the value of the Clebsch-Gordan coefficient C(2,1; 1,1,1, 0)? 
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7.27 The angular momentum of a hydrogen atom in its ground state is 
entirely due to the spins of the electron and proton. The atom is in the state 
11,0) in which it has one unit of angular momentum but none of it is parallel 
to the z-axis. Express this state as a linear combination of products of the 
spin states |±, e) and |±, p) of the proton and electron. Show that the states 
|x±,e) in which the electron has well-defined spin along the a;-axis are 

k±. e > = -^2 d+’ e ) ± K e »- (7.183) 


By writing 


11,0) = |aH-,e)(aH-,e|l,0) + \x-, e){x-, e|l, 0), (7.184) 


express |1, 0) as a linear combination of the products |x±, e)|a:±. p). Explain 
the physical significance of your result. 

7 . 28 * The interaction between neighbouring spin-half atoms in a crystal is 
described by the Hamiltonian 


H = K 


/ SW -S( 2 ) _ (SW -a)(S( 2 ) -a) \ 
\ a a 3 J 


(7.185) 


where K is a constant, a is the separation of the atoms and is the first 
atom’s spin operator. Explain what physical idea underlies this form of H. 
Show that si 2) + S'y 1 ' 1 Sy 2) = \ (S 1 ^ S^ ). Show that the mutual 

eigenkets of the total spin operators S 2 and S z are also eigenstates of H and 
find the corresponding eigenvalues. 

At time t = 0 particle 1 has its spin parallel to a, while the other 
particle’s spin is antiparallel to a. Find the time required for both spins to 
reverse their orientations. 
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Hydrogen 


Wherever we look, down at ourselves or up into the vastness of the Uni¬ 
verse, what we see are atoms. The way atoms interact with themselves and 
with electromagnetic radiation structures the world about us, giving colour, 
texture, solidity or fluidity to all things, both alive and inanimate. In the 
wider Universe the way visible matter has aggregated into stars and galaxies 
is determined by the interplay between atoms and radiation. In the last two 
decades of the twentieth century it emerged that atoms do not really domi¬ 
nate the Universe; on large scales they are rather like icing on the cake. But 
they certainly dominate planet Earth, and, like the icing, they are all we can 
see of the cosmic cake. 

Besides the inherent interest of atomic structure, there is the histori¬ 
cal fact that the formative years of quantum mechanics were dominated by 
experimental investigations of atomic structure. Most of the principles of 
the subject were developed to explain atomic phenomena, and the stature of 
these phenomena in the minds of physicists was greatly enhanced through 
the role they played in revolutionising physics. 

It is an unfortunate fact that atoms are complex systems that are not 
easily modelled to a precision as good as that with which they are commonly 
measured. The complexity of an atom increases with the number of electrons 
that it contains, both because the electrons interact with one another as well 
as with the nucleus, and because the more electrons there are, the higher 
the nuclear charge and the faster electrons can move. By the middle of the 
periodic table the speeds of the fastest electrons are approaching the speed 
of light and relativistic effects are important. 

In this chapter we develop a model of the simplest atom, hydrogen, 
that accounts for most, but not all, measurements. In Chapter 10 we will 
take the first steps towards a model of the second most complex atom, he¬ 
lium, and indicate general trends in atomic properties as one proceeds down 
the periodic table. The ideas we use will depend heavily on the model of 
hydrogen-like systems that is developed in this chapter. With these appli¬ 
cations in view, we generalise from hydrogen to a hydrogen-like ion, in 
which a single electron is bound to a nucleus of charge Ze. 
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Chapter 8: Hydrogen 


8.1 Gross structure of hydrogen 

We start with a rather crude model of a hydrogen-like ion. In this model 
neither the electron nor the nucleus has a spin, and the electron moves non- 
relativistically under purely electrostatic forces. The structure of an atom or 
ion that is obtained using these approximations is called its gross structure. 
The approximations make it easy to write down the model Hamiltonian be¬ 
cause they include just three contributions to the energy: the kinetic energies 
of the nucleus and the electron, and the bodies’ electrostatic binding energy: 


H = 


+ 


Ze 2 


2?n n 2 m e 47reo|x e — : 


( 8 . 1 ) 


where x e and x n are the position operators of the electron and the nucleus, 
respectively, and p e and p n are the corresponding momentum operators. We 
wish to solve the eigenvalue equation H\E) = E\E) for this Hamiltonian. 
In the position representation, the momentum operators become derivative 
operators, and the eigenvalue equation becomes a partial differential equation 
in six variables 


^(x n ,Xe) = 


Ti 2 2 fi 2 r, Ze 2, ip 

7 -V 2 V» - ~-V e 2 ^ - -j-7, 

2 m B 2 m e 4 tt£ 0 x e - x n 


( 8 . 2 ) 


where a subscript e or n on V implies the use of derivatives with respect to the 
components of x e or x n . Remarkably, we can solve this frightening equation 
exactly. The key step is to introduce six new variables, the components of 


X = 


m e x e + m n x n 


m e + m n 


and r = x e — x n . 


(8.3) 


X is the location of the ion’s centre of mass, and r is the vector from the 
nucleus to the electron. The chain rule yields 


d dX d dr d m e d d 

9x e dx e <9X <9x e dr m e + m n dX dr 

When we take the dot product of each side with itself, we find 


V 


2 

e 


( Vy2 . V 2 2 m e d 2 

\ m e + m n ) x r m e + m n dX ■ dr ’ 


(8.4) 


(8.5a) 


where the subscripts X and r imply that the operator is to be made up of 
derivatives with respect to the components of X or r. Similarly 


V 


2 

n 


m n 


m n 


x 


2 m n d 2 
m e + m n dX-dr' 


(8.5b) 


We now add m e 1 times equation (8.5a) to m n 1 times equation (8.5b). The 
mixed derivatives cancel leaving 


m, 


-U 


m 


-l 


Vr = 


-Vi 


m e 


-V 2 

v r? 


(8.6a) 


where 


m. e m n 
m e + m n 


(8.6b) 


is called the reduced mass of the electron. In the case of hydrogen, when 
m n = to p = 1836m e , the reduced mass differs very little from m e (p, = 
0.99945??7. e ), and in heavier hydrogen-like ions the value of /./, lies even closer 
to m e . 
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Figure 8.1 The effective potential 
(8.13) for Z = 1 and (from bottom 
to top) l = 0,1, 2, 3, 4. 


When we use equation (8.6a) to replace x e and x n in equation (8.2) by 
r and X, we obtain 


Eip 


n 2 

2 (m e + m„) 





Ze 2 

4 ^ 60 ^ 


V’- 


(8.7) 


The right side breaks into two parts: the first term is the Hamiltonian Hk of 
a free particle of mass m e + m n , while the second and third terms make up 
the Hamiltonian H r of a particle of mass p that is attracted to the origin by 
an inverse-square law force. Since Hk and H r commute with one another, 
there is a complete set of mutual eigenkets. In §6.1.2 we showed (page 109) 
that in these circumstances we can assume that ip is a product 


^(x e ,x n ) = K(X)ip r (r), (8.8) 


where 


and 


n 2 

2 (m e + m n ) 


vy< = e k i< 


(8.9) 



Ze 2 ip r 
47 reo r 


E r ip r . 


( 8 . 10 ) 


Here Ek and E r are two distinct eigenvalues and their sum is the ion’s total 
energy, E = E K + E r . 

From §2.3.3 we know all about the dynamics of a free particle, so equa¬ 
tion (8.9) need not detain us. We have to solve equation (8.10). In the 
interests of simplicity we henceforth omit the subscript E r . 

Equation (7.69) enables us to write the kinetic energy term in equation 
(8.10) in terms of the radial momentum operator p r and the total orbital 
angular momentum operator L 2 . Equation (8.10) is then the eigenvalue 
equation of the Hamiltonian 



h 2 L 2 
2 pr 2 


Ze 2 

47reor 


( 8 . 11 ) 


L 2 commutes with H since the only occurrence in H of the angles 0 and <p 
is in L 2 itself. So there is a complete set of mutual eigenstates | E,l,m) of 
H, L 2 and L z such that L 2 \E, l,m) = 1(1 + 1)| E,l,m). For these kets the 
operator H of equation (8.11) is equivalent to the radial Hamiltonian 



l{l + 1 )h 2 

2 pr 2 


Ze 2 


47reo r 


( 8 . 12 ) 
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The operator (8.12) is the Hamiltonian for a particle that moves in one 
dimension in the effective potential 


V'eff(r) 


1(1 + 1 )h 2 Ze 2 

2 pr 2 47reor 


(8.13) 


The first term in this expression is the kinetic energy that conservation of 
angular momentum requires in tangential motion, while the second term is 
the electrostatic potential energy. V e g is plotted in Figure 8.1 for l = 0,..., 4. 
The radial Hamiltonian Hi governs the oscillations of the reduced mass 
around the minimum for V e g. By astute exploitation of natural coordinates 
and symmetry we have reduced our original intimidating Hamiltonian (8.1), 
which contained twelve operators, to a Hamiltonian Hi that contains only 
two operators. The eigenkets of the Hamiltonian H r for the internal struc¬ 
ture of the ion are products of the eigenkets | E,l) of Hi and eigenkets \l,m) 
of L 2 and L z : 

\E,l,m) = \E,l)\l,m). (8-14) 

Hi is strikingly similar to the radial Hamiltonian defined by equation 
(7.81) for which we solved the eigenvalue problem in the course of our study 
of the three-dimensional harmonic oscillator. We use essentially the same 
technique now, defining the dimensionless ladder operator 


a _ °o (i _ l +1 Z \ 

1 ~ \/2 \ fr Pr r (l + l)o 0 ) 


where we have identified the Bohr radius 1 


a 0 = 


47reo?i 2 

pe 2 


(8.15a) 


(8.15b) 


The product of Ai with Ad is 



2 



Z 

(l + l)«o 
Z 

(l + l)ao 


l + 1 \ f i Z 

r ) \h Pr + (l + l)a 0 


l + l 

r 



l + l 


(8.16) 


Equations (2.25) and (7.67) enable us to evaluate the commutator in this 
expression, so we have 


AjAi = ^(4 
1 2 U 2 


On 


a 2 0 p 


Pr 


Hi + 


(1 + l ) 2 


(l + D 2 a 2 0 
l{l + 1) 2Z 

r 2 aor 
Z 2 


2 (Z + 1 ) 2 ' 


-.. i l + 1, 

- — -¥-S-tPr» r . 


2 z 

a 0 r 


(l + l) 2 a 2 0 


(8.17) 


If we evaluated the product AiAl^ the sign in front of the commutator in the 
first line of equation (8.17) would be reversed, so we would find 




n 2 Hl+1 2(1 + 1 ) 2 ' 


(8.18) 


1 The physical significance of ao is clarified by rewriting equation (8.15b) in the form 
e 2 /(47reoao) = (fi/ao ) 2 /^- The left side is the electrostatic potential energy at ao and 
the right side is twice the kinetic energy of zero-point motion (§3.1) of a particle whose 
position has uncertainty ~ ao- For hydrogen ao = 5.29177 X 10 —11 m. 
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Taking equation (8.17) from equation (8.18) we obtain the commutator 

\Ai,A\) = C ^(H l+1 -H l ), (8.19) 

a result that recalls equation (7.88) for the three-dimensional harmonic os¬ 
cillator. 

It is useful to rewrite equation (8.17) in the form 


Hi = 


V>al 


A] A, - 


2(1+ iy 


( 8 . 20 ) 


Commuting each side of this equation with Ai and using equation (8.19), we 
obtain an expression for the commutator of Hi with Af. 

[A u Hi\ = ^[AuAlAt] = A l ,A l \]At = (H l+1 - H t )Ai. (8.21) 

t ia o 

This equation simplifies to 


AiHi = H i+x Ai. 


( 8 . 22 ) 


We show that Ai is a ladder operator by multiplying it into both sides 
of the eigenvalue equation Hi\E,l) = E\E,l) and using equation (8.22): 

EAi\E,l) = AiH t \E,l) = H l+1 Ai\ E,l). (8.23) 


This equation states that Ai\E,l) is an eigenket of Hi + 1 with eigenvalue E. 
That is, Ai transfers energy from the electron’s radial motion to its tangential 
motion. If we repeat this process by multiplying Ai\E, l) by Aj+i, and so on, 
we will eventually arrive at a circular orbit. Let C(E) denote the l value of 
this orbit. Then Ac, must annihilate | E,C) because, if it did not, we would 
have a state with even greater angular momentum. Thus with equation 
(8.17) we can write 


0=\A c \E,C)\ 2 


(E,£\AlA c \EX) = ^E + 


2(C + l) 2 


(8.24) 


That is, 

E _ Z 2 Ti 2 _ Z 2 e 2 _ 

2yia^n 2 8neoaon 2 2n 2 ( , l'Keo‘h) 2 ' 


(8.25) 


where we have defined the principal quantum number n = C + 1 and the 
second equality uses the definition (8.15b) of the Bohr radius. The Rydberg 
constant 1Z is 


n = 


n 2 

2 /ra 2 


e 2 i / e 2 \ 2 

87reoao 2 ^ \47reo?i / 


13.6056923 eY, 


(8.26) 


where p = m e m p / (m e +m p ) is the reduced mass in the case of hydrogen. The 
Rydberg constant enables us to give a compact expression for the permitted 
values of E and l in hydrogen 


E = - p (n = 1, 2,...) ; 0<Z<n-l. (8.27) 

n z 

Henceforth we use n rather than E to label kets and wavefunctions. Thus 
| n,l,m) = \n,l)\l,m) (cf. eq. 8.14) is the stationary state of a hydrogen-like 
ion for the energy given by (8.25) and the stated angular-momentum quan¬ 
tum numbers. The ground state is 11, 0,0). The energy level immediately 
above the ground state is four-fold degenerate, being spanned by the states 
|2, 0,0), |2,1,0) and |2,1,±1). The second excited energy level is 9-fold de¬ 
generate, and so on. 

This property of our model hydrogen atom, that it has states with dif¬ 
ferent l but the same energy, is unusual and reflects a hidden symmetry of 
our model - see Appendix F for details. Atoms with more than one electron 
have energy levels that depend explicitly on l even when spin and relativity 
are neglected. When our model of hydrogen is upgraded to include spin and 
relativity, E becomes weakly dependent on l. 
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Figure 8.2 Schematic diagram of 
the Lyman, Baimer and Paschen se¬ 
ries of spectral lines in the spectrum 
of hydrogen. 


8.1.1 Emission-line spectra 

A hydrogen atom may change its value of n to a smaller value n', releasing 
the liberated energy as a photon of frequency v = (E n — E n f)/h. Hence the 
emission spectrum of hydrogen contains lines at the frequencies 



The lines associated with a given lower level n' form a series of lines of 
increasing frequency and decreasing wavelength. The series associated with 
n' = 1 is called the Lyman series, the longest-wavelength member of which 
is the Lyman a line at 121.5nm, followed by the Ly/3 line at 102.5 nm, 
and so on up to the series limit at 91.2 nm. The series associated with 
n' = 2 is called the Balmer series and starts with a line called Ha at 

656.2 nm and continues with H/3 at 486.1 nm towards the series limit at 
364.6 nm. The series associated with n' = 3 is the Paschen series, and 
that associated with n' = 4 is the Brackett series. Figure 8.2 shows the 
first three series schematically. Historically the discovery in 1885 by Johann 
Balmer (1825-1898), a Swiss schoolmaster, that the principal lines in the 
optical spectrum of hydrogen could be fitted by equation (8.28), was crucial 
for the development of Niels Bohr’s model atom of 1913, which was the 
precursor of the current quantum theory (Problem 8.3). 

Equation (8.25) states that, for given n, the energy of an electron scales 
as Z 2 . For a many-electron atom electromagnetic interactions between the 
electrons invalidate this scaling. However, it holds to a fair approximation 
for electrons that have the smallest values of n because these electrons are 
trapped in the immediate vicinity of the nucleus and their dynamics is largely 
unaffected by the presence of electrons at larger radii. Henry Moseley (1887 
1915) studied the frequencies of X-rays given off when atoms were bombarded 
by free electrons. He showed 2 that the frequencies of similar spectral lines 
from different elements seemed to scale with the square of the atomic number. 
At that time the periodic table was something constructed by chemists that 
lacked a solid physical foundation. In particular, the atomic numbers of some 
elements were incorrectly assigned. Moseley’s experiments led to the order 
of cobalt and nickel being reversed, and correctly predicted that elements 
with atomic numbers 43, 61, 72, 75, 87 and 91 would be discovered. 


8.1.2 Radial eigenfunctions 

The wavefunctions of lrydrogen-like ions are not only important for exper¬ 
iments with atoms and ions that have only one electron, but are also the 
building blocks from which models of many-electron atoms are built. 

The first step in finding any radial eigenfunction for a hydrogen-like ion 
is to write the equation A„_i|n, n— 1) = 0 (eq. 8.24) as a differential equation 
for the radial wavefunction of the circular orbit with angular-momentum 

2 Moseley, H.G.J., 1913, Phil. Mag., 27, 703. The lines studied by Moseley were 
associated with transitions n = 2 —> 1. 
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Figure 8.3 The radial wavefunctions u'/, 1 (r) of “circular” orbits for n = 1,2 and 3. 


quantum number l = n — 1. From equations (8.15a) and (7.66) we need to 
solve 

IX" 1 +(-— + —) < _1 = 0, (8.29) 

or \ r nao J 

where 

u n( r ) = {r\n,l). (8.30) 

Equation (8.29) is a first-order linear differential equation. Its integrating 
factor is 

exp jy dr (^~~~ + | = r- (n " 1) e Zr/no °, (8.31) 

so the required eigenfunction is 

<“!(r) = Cr n ~ i e - Zr/nao , (8.32) 


where C is a normalising constant. This wavefunction is very similar to 
our expression (7.97) for the wavefunction of a circular orbit in the three- 
dimensional harmonic oscillator potential - the only difference is that the 
Gaussian function has been replaced by a simple exponential. The scale- 
length in the exponential is ( [n/Z)ao , so it increases with energy and decreases 
with the nuclear charge. This makes perfect sense physically because it states 
that more energetic electrons can go further from a given nucleus, and that 
nuclei with higher electric charge will bind their (innermost) electrons more 
tightly. 

We choose the normalising constant C in equation (8.32) to ensure that 
the complete wavefunction (eq. 8.14) 

(r,fl,0|(|f?,Z>|Z,m)) = <r|n,/)(fl,0|i,m)=«i,(r)Yr(0,0) (8.33) 

is correctly normalised. Bearing in mind that d 3 x = r 2 drd 2 fi and that 
f d 2 fl |Y["| 2 = 1, we find that C must satisfy 


1 = C 2 


d r r 2n e ~ 2 Zr/na 0 



= c 2 


/na,Q\ 2n + 1 

\ 2 z) 


(2 n)\, 


(8.34) 


where we have evaluated the integral with the aid of Box 8.1. The correctly 
normalised radial wavefunction is therefore 


L (r) = 


1 


\J (2n)! \na 0 J 


3/2 


(-) 

V nao J 


n— 1 


a — Zr/na 0 


(8.35) 


These functions are plotted for n = 1 — 3 in Figure 8.3. For n > 1 the 
wavefunction rises from zero at the origin to a peak at r = n(n— l)ao/Z and 
from there falls exponentially with increasing r. 
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Figure 8.4 The probability of find¬ 
ing the electron of a hydrogen atom 
that is in its ground state at a ra¬ 
dius greater than r. Radii greater 
than 2ao/Z are classically forbidden. 


Box 8.1: The factorial function 

We often encounter the integral T(a + 1) = / 0 °° dff“e _t . Integrating by 
parts we find that 

/»oo 

T(a + 1 ) = — [f“e _t ]^° + a / dff a_ 1 e _t 

Jo 

= aY(a). 

It is easy to check that T(l) = 1. Putting a = 1 in the last equation it 
follows that T(2) = 1. Setting a = 2 we find P(3) = 2, and repeating 
this process we see that for any integer n, T(n +1) = n!. We can use this 
result to define the factorial function by 

/»oo 

z\ = r(3 + l) = / df t z e“*. (1) 

Jo 

This definition yields a well defined value for z! for any complex number 
that is not a negative integer, and it coincides with the usual definition 
of a factorial if 2 happens to be a non-negative integer. 


We obtain the ground-state radial wavefunction by setting n = 1 in 
equation (8.35): 


/ 7 \ 3/2 

u°i(r)= 2 — e~ Zr/a °. (8.36) 

\ a o/ 


The complete wavefunction is obtained by multiplying this by Y[( = (47 t) - 1 / 2 . 
Figure 8.4 shows the probability of finding the electron at a radius greater 
than r. This reaches 13/e 4 ~ 0.24 at r = 2a$/Z , where the potential energy 
is equal to the total energy. In classical physics the probability of finding the 
electron at these radii is zero. 

It is interesting to calculate the expectation value of r for circular orbits. 
We have 


/ , ,, , , 1 {2Z\ 3 f°° J 3 (2Zr 

{n, n — 1, to r n, n — 1, m) = - — —I - / d?’r - 

(2n)! \naoJ J 0 \na 0 


2(n—1) 

— 2Zr/na 0 


_na 0 1 r A . 

2Z(2n)\J 0 CPP 


2n+l e ~p _ 


= n{n+\) C ^. 


(8.37) 

In the classical limit of large n, (r) ~ n 2 a 0 /Z , so FI oc 1/n 2 oc 1/ (r) as 
classical physics predicts. (One can easily show that classical physics yields 
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Figure 8.5 Probability densities for 
three orbits in hydrogen. All orbits have 
n = 3 and m = l. Clockwise from top 
left l increases from 0 to 2. The grey 
scale shows the log to base 10 of the 
probability density. 


the correct proportionality constant.) A very similar calculation shows that 

(r 2 ) = n 2 {n + 1 )(n + \)a 2 0 /Z 2 = (r) 2 , (8.38) 

n + 5 

so the rms uncertainty in r is yj ( r 2 ) — (r) 2 = (r) j\J2n + 1. Consequently, 

as n increases, the uncertainty in r increases as n 3 / 2 , but the fractional 
uncertainty in r decreases as vT 1 ^ 2 . 

Our conclusion that the radius of an atom scales as n 2 implies that an 
atom with n ~ 100 occupies 10 12 times as much volume as an atom in the 
ground state. Consequently, only at high-vacuum densities can such highly 
excited atoms be considered isolated systems. Radio telescopes detect line 
radiation emitted by hydrogen atoms in the interstellar medium that are 
reducing their value of n by 5n from n — 100. The frequency of such a 
transition is 


E n +Sn E n 


~6.58( —I 6n GHz. (8.39) 


Our analysis of the three-dimensional harmonic oscillator suggests that 
applications of A\, to should generate the wavefunctions u l n for l < n— 1. 
We show that this is indeed the case by daggering both sides of equation 
(8.22) to obtain 

HiA\=A\H i+1 . (8.40) 

Consequently, applying A] to both sides of E\E,l + 1) = Hi + i\E,l + 1) we 
have 

E(A]\E, 1+ 1) = A\H l+1 \E, 1 + 1) = H t (A\\E, l + 1)) 

which establishes that Aj\E,l + 1) is an eigenket of Hi as we hoped. Using 
a result proved in Problem get.AAdaggerprob , we have, in fact, that 


\n,l) 


V2 ( 1 
z U + 1) 


1 \ _1/2 

—) A\\n,l + l). 


(8.41) 




190 


Chapter 8: Hydrogen 


Table 8.1 The first six radial eigenfunctions u l n (r) for hydrogen with 
az = ao/Z. The full wavefunction is u l n (r)Y] n (9, <j>). 


I 


0 


1 


2 e" r / az 



1 


2 


n 

2 


3 


2e —r/2a z ^ r 

(2a z )3/ 2 V 1 " 2^ 


e ~r/2az r 

v / 3(2az) 3 / 2 


2e~ r / 3az ( 2r 2r 2 \ 

(3az) 3/2 \ 3a 2 + 27a l / 

25/2 e -r/3 az r / r \ 

9(3az) 3 / 2 az \ 6az / 

2 3/2 e -r/3a z / r \ 2 

v /27 v / 5(3az) 3 / 2 W/ 


From equations (8.15a) and (7.66) we can write 


_ __£o_ / l + 2 _ 

* yj2 \dr r (l + l)a 0 J ' 


Setting l = n — 2 we can apply this operator to u™ 1 to obtain 


n —2 


(r) = constant x ( 1 — 


Zr 


n(n — 1 )oq 


j.n-2 e ~Zr/na 0 


(8.42) 


(8.43) 


This wavefunction has a node at r = n{n— \)clq/Z. When we apply Al l _ 3 to 
this wavefunction to generate u" -3 , the lowest power of r in the factor that 
multiplies the exponential will be r” -3 , so the exponential will be multiplied 
by a quadratic in r and the wavefunction will have two nodes. In our study of 
the three-dimensional harmonic oscillator we encountered the same pattern: 
the number of nodes in the radial wavefunction increased by one every time 
decrements the angular momentum and increases the energy of radial 
motion. The radial eigenfunctions for states with n < 3 are listed in Table 8.1 
and plotted in Figure 8.5. 

Notice that because ■u” _1 (r) is a real function and A] is a real operator, 
all the radial eigenfunctions are real. Because the probability current J is 
proportional to the gradient of the phase of the wavefunction (eq. 2.87), the 
reality of u l n (r) implies that the probability current inside the atom has no 
radial component. This makes perfectly good sense physically: the electron 
moves both inwards and outwards and (unlike in the classical case) at any 
given point the electron is as likely to be moving out as in. 


8.1.3 Shielding 

The electrostatic potential in which a bound electron moves is never ex¬ 
actly proportional to 1/r as we have hitherto assumed. In hydrogen or 
a single-electron ion the deviations from 1/r proportionality are small but 
measurable. In many-electron systems the deviations are large. In all cases 
the deviations arise because the charge distribution that binds the electron 
is not confined to a point as we have assumed. First, protons and neu¬ 
trons have non-zero radii after all there has to be room for three quarks 
to move about in there at mildly relativistic speed! Second, even if the nu¬ 
clear charge were confined to a point, the field it generates would not be an 
inverse-square field because in the intense electric field that surrounds the 
nucleus, a non-negligible charge density arises in the vacuum. This charge 
density is predicted by quantum electrodynamics, the theory of the interac¬ 
tion of the Dirac field, whose excitations constitute electrons and positrons, 
and the electromagnetic field, whose excitations are photons. In a vacuum 
the zero-point motions (§3.1) of these fields cause electron-positron pairs to 
be constantly created, only to annihilate an extremely short time later. In 
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the strong field near the nucleus, the positrons tend to spend their brief 
lives further from the nucleus, which repels them, than do the electrons. In 
consequence the charge inside a sphere drawn around the nucleus is slightly 
smaller than the charge on the nucleus, the charge deficit being small for 
both very small and very large spheres. That is, quantum electrodynamics 
predicts that the vacuum is a polarisable dielectric medium, just like an or¬ 
dinary insulator, in which the electrons and ions move in opposite directions 
when a field is applied, giving rise to a net charge density within the medium. 

When an atom has more than one electron, the deviation of the elec¬ 
trostatic potential from 1/r proportionality is much larger than in hydrogen 
since the charge on any electron other than the one whose dynamics we are 
studying is distributed by quantum uncertainty through the space around 
the nucleus, so the charge inside a sphere around the nucleus is comparable 
to the charge on the nucleus when the sphere is very small, but falls to e 
when the sphere is large. 

Phenomena of this type, in which there is a tendency for a charged 
body to gather oppositely charged bodies around it, are often referred to as 
‘shielding’. A complete treatment of the action of shielding in even single¬ 
electron systems involves quantum field theory and is extremely complex. 
In this section we modify the results we have obtained so far to explore an 
idealised model of shielding, which makes it clear how shielding modifies the 
energy spectrum, and thus the dynamics of atomic species. 

The key idea is to replace the atomic number Z in the Hamiltonian with 
a decreasing function of radius. We adopt 

Z(r) = Z 0 (l + , (8.44) 

where Zq and a are adjustable parameters. For r a, the nuclear charge 
tends to a maximally shielded value Z$e. For r ~ a, the charge is larger 
by ~ Z^e. At very small r, the charge diverges, but we anticipate that 
this unphysical divergence will not have important consequences because the 
electron is very unlikely to be found at r <C clq/Z. With this choice for Z(r), 
the radial Hamiltonian (8.12) becomes 


= rf {1(1 + 1) - I3}h 2 _ Z 0 e 2 
1 2/i 2/ir 2 47reo r’ 


(8.45a) 


where 


P = 


Zoafre 2 

2-Keoh 2 


(8.45b) 


Because we chose to take the radial dependence of Z to be proportional to 
1/r, we have in the end simply reduced the repulsive centrifugal potential 
term in the radial Hamiltonian. Let l'(l) be the positive root of the quadratic 
equation 

l'(l'+ 1) =1(1 + 1)-j3. (8.46) 


In general V will not be an integer. With this definition, H( is identical with 
the Hamiltonian (8.12) of the unshielded case with V substituted for l and 
Zq replacing Z, that is 

= (8.47) 

Consequently, the operator Ay that is defined by equation (8.15a) with the 
same substitutions satisfies (cf. eq. 8.17) 


A), Ay = - 4 ^—Hy 


7 2 
Z 0 


2 ( 1 ' + 1 )' 


(8.48) 


Moreover by analogy with equation (8.19) we have 
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It follows that Ait is a ladder operator 

EA v \E,l') = {H V A V + [A^H^EJ') = H v+1 A v \E,l'), (8.50) 

so Ai'\E,l') = a\E,V + 1) is an (unnormalised) eigenket of Hy+i just as in 
the unshielded case. Applying Ai > + 1 to | E,l’ + 1) we argue that eventually 
some maximum value C! of V will be reached, at which point Ac\E, C) = 0. 
From the mod square of this equation we conclude that 

E = ~ 8,,wi- + ir where £ ' = i ' (i) + t ' ( 8 - 51 > 

where k is the number of times we have to apply A to achieve annihilation. 
Since for a ^ 0, V{1) is not an integer, E is given by the formula (8.25) 
for the unshielded case with n replaced by a number that is not an integer. 
Moreover, the energy now depends on l as well as on n, where n is defined 
to be l + l plus the number of nodes in the radial wavefunction at r < oo. 
To see this, consider the effect of increasing our initial value of l by one, and 
correspondingly decreasing by one the number of times we have to apply A;/ 
to achieve annihilation. In an unshielded atom l 1 = l, so E is unchanged 
when l is increased and k decreased by unity; we have moved between states 
with the same value of n. In the shielded case, increasing l by unity does 
not increment l'(l) by unity, so in equation (8.51) the changes in 1/ and k do 
not conspire to hold constant C. In fact one can show from equation (8.46) 
that when l increases by one, V increases by more than one (Problem 8.13), 
so among states with a given principal quantum number, those with the 
largest l values have the smallest binding energies. This makes perfect sense 
physically because it is the eccentric orbits that take the electron close to 
the nucleus, where the nuclear charge appears greatest. 

In 1947 Lamb & Retherford showed 3 that in hydrogen the state |2, 0,0) 
lies 4.4 x 10 _6 eV below the states |2, l,m), contrary to naive predictions 
from the Dirac equation. This Lamb shift is due to shielding of the proton 
by electron-positron pairs in the surrounding vacuum. 

8.1.4 Expectation values for r~ k 

It will prove expedient to have formulae for the expectation value of r~ k with 
hydrogenic wavefunctions and the first three values of k. 

The value of (r _1 ) can be obtained from the virial theorem (2.93) since 
in hydrogen the potential energy is oc r -1 . With a = —1, equation (2.93) 
implies that 

2(E\^-\E) = -(E\V\E). (8.52) 

On the other hand the expectation of the Hamiltonian yields 

(E\^\E) + (E\V\E)=E, (8.53) 

so we have 

Zp 2 Z 2 p 2 

< £ ' V 'l £ > = = 2E = - 4 ^++' < 8 ' 54) 

It follows that (r -1 ) = Z/(n 2 ao). 

To obtain (r -2 ) we anticipate a result that we shall prove in §9.1. This 
relates to what happens when we add a term fiH\ to a system’s Hamiltonian, 
where /? is a number and H\ is an operator. The n th eigenenergy of the 


3 W.E. Lamb & R.C. Retherford, Phys. Rev. 72, 241 
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complete Hamiltonian then becomes a function of /3, and in §9.1 we show 
that 

= (E\H 1 \E). (8.55) 

/ 3=0 


d E 
dp 


We apply this result to a hydrogen-like system with the additional Hamilto¬ 
nian 


ffi = 


n 2 

2 fir 2 


(8.56) 


In the last subsection we showed that the exact eigenvalues of this system 
are given by equation (8.51). Differentiating the eigenvalues with respect to 
ft and using equation (8.55) we find 


n , , o. . d 

—= — 


Z 2 e 2 


/ 3 =o 8ne 0 ao(l' + k ) 2 


Z 2 e 2 dl' 
4:Treoao(l + k) 3 dp 


(8.57) 


From equation (8.46) we have dl'/dp = —1/(2 1 + 1), so 


(E\r~ 2 \E) 


Z 2 e 2 /r 

2neoaoh 2 n 3 (21 + 1) 


Z 2 

a^n 3 (l + i) ’ 


(8.58) 


where the last equality uses the definition (8.15b) of the Bohr radius. 

We determine (c -3 ) by considering the expectation value of the com¬ 
mutator [H,p r ], As we saw in §2.2.1, in a stationary state the commutator 
with H of any observable vanishes. Hence with equation (8.12) we can write 


°= (E\[H,p r ]\E) 


l l±^(E\[r- 2 ,p r ]\E) ^-(E\[r-\ Pr ]\E) (8.59) 


Using the canonical commutation relation [ r lPr ] = i fi [equation (7.67)] to 
evaluate the commutators in this expression, and the value of (r~ 2 ) that we 
have just established, we find 


(E\r~ 3 \E) 


Z 3 

aln 3 l(l + !)(/ + i) 


(8.60) 


The three values of (r fc ) that we have calculated conform to a pattern. 
First the basic atomic scale clq/Z is raised to the — k th power. Then there is 
a product of 2k quantum numbers on the bottom, reflecting the tendency for 
the atom’s size to grow as n 2 . Finally, as k increases, the number of factors 
of l increases from zero to three, reflecting the growing sensitivity of (?' _3 ) 
to orbital eccentricity. 


8.2 Fine structure and beyond 

The model of hydrogen-like ions that we developed in the last section is 
satisfying and useful, but it is far from complete. We now consider some of 
the physics that is neglected by this model. 

In §2.3.5 we saw that when a particle that moves in an inverse-square 
force field is in a stationary state, the expectation value of its kinetic energy, 
classically ^mv 2 , is equal in magnitude but opposite in sign to its total 
energy. Equation (8.25) is an expression for the ground-state energy of an 
electron in a liydrogen-like ion. When we equate the absolute value of this 
expression to \m e v 2 , we find that the ratio of v to the speed of light c is 

- = aZ, 
c 


(8.61) 
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where the dimensionless fine structure constant is defined to be 


a = 


47reo he 


1 

137' 


(8.62) 


Since relativistic corrections tend to be O (v 2 /c 2 ), it follows that in hydrogen 
relativistic corrections to the results we have derived may be expected to 
be several parts in 10 5 , but these corrections, being proportional to Z 2 , will 
exceed 10% by the middle of the periodic table. 

For future reference we note that with the reduced mass /i approximated 
by m e , equation (8.15b) for the Bohr radius can be written 


h 

a 0 = - 

am e c 


^Compton 

27ra 


(8.63) 


where we have identified the electron’s Compton wavelength h/m e c (the 
wavelength of a photon that has energy m e c 2 ). When we use this expression 
to eliminate ao from equation (8.25), we find that the energy levels of a 
hydrogen-like ion are 

Z 2 a 2 

E = ——— ?n e c 2 , so 7 Z = ba 2 m e (?. (8.64) 

2 n z 


8.2.1 Spin-orbit coupling 

Magnetism is a relativistic correction to electrostatics in the sense that a 
particle that is moving with velocity v in an electric field E experiences a 
magnetic field 

B = x E. (8.65) 

c z 

If the particle has a magnetic dipole moment p, it experiences a torque 
G = p x B that will cause its spin S to precess. In the particle’s rest frame 
the classical equation of motion of S is 


= 7^ X B ’ ( 8 - 66 ) 

where h appears only because S is the dimensionless spin obtained by divid¬ 
ing the angular momentum by h. We assume that the magnetic moment p is 
proportional to the dimensionless spin vector S and write the proportionality 


p = 


2mo 


(8.67) 


where g is the dimensionless gyromagnetic ratio, and Q and ?7i 0 are the 
particle’s charge and rest mass. In the case of an electron g = 2.002, a value 
which is correctly predicted by relativistic quantum electrodynamics, and 
the dimensional factor is defined to be the Bohr magneton 


Pb = 


eh 
2 m e 


= 9.27 x 10" 24 JT” 1 . 


( 8 . 68 ) 


With this notation, our rest-frame equation of motion (8.66) becomes 


dS 
d t 


gQ 

2?7lo 


S x B. 


(8.69) 


The non-zero value of the right side of this classical equation of motion 
for S implies that there is a spin-dependent term in the particle’s Hamiltonian 
since the operator S commutes with all spatial operators (§7.4) and the 
right side of the classical equation of motion (8.69) is proportional to the 
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expectation of [S,_fT] (cf. eq. 2.57). We want to determine what this term in 
H is. 

Energy is not a relativistic invariant - it is physically obvious that ob¬ 
servers who move relative to one another assign different energies to a given 
system. Consequently, when they do quantum mechanics they use different 
Hamiltonians. We need the Hamiltonian that governs the dynamics of the 
reduced particle in the rest frame of the atom’s centre of mass. So we have 
to transform the equation of motion (8.69) to this frame. This is a tricky 
business because the reduced particle is accelerating, so the required Lorentz 
transformation is time-dependent. Given the delicacy of the required trans¬ 
formation, it is advisable to work throughout with explicitly Lorentz ‘covari- 
ant’ quantities, which are explained in Appendix G. In Appendix H these 
are used to show that in a frame of reference in which the electron is moving, 
equation (8.69) becomes 


dS 

d t 


Q ( 

2to 0 c 2 V 


h d$ 
mor dr 


S x L + 2c 2 S x B 


(8.70) 


It is straightforward to demonstrate (Problem 8.14) from equation (2.34) that 
this classical equation of motion of the spin S of an electron (which has charge 
Q = —e) arises if we introduce into the quantum-mechanical Hamiltonian 
(8.1) two spin-dependent terms, namely the spin-orbit Hamiltonian 


Hso 


d$ eh 2 
dr 2rTOgC 2 


S L, 


(8.71) 


and the Zeeman spin Hamiltonian 


H zs = — S-B. 

m e 


(8.72) 


The Zeeman spin Hamiltonian is just p • B with equation (8.67) used to 
replace the magnetic moment operator by the spin operator. Interestingly, 
the spin-orbit Hamiltonian is a factor two smaller than p • B with p replaced 
in the same way and equation (8.65) used to relate B to the electric field 
in which the electron is moving. In the 1920s the experimental data clearly 
required this factor of two difference in the spin Hamiltonians, but its origin 
puzzled the pioneers of the subject until, in 1927, L.T. Thomas 4 showed that 
it is a consequence of the fact that the electron’s rest frame is accelerating 
(Appendix H). If no torque is applied to a gyro, it does not precess in its 
instantaneous rest frame. But if the direction of the gyro’s motion is chang¬ 
ing relative to some inertial frame, the sequence of Lorentz transformations 
that are required to transform the spin vector into the inertial frame causes 
the spin to precess in the inertial frame. This apparent precession of an 
accelerated gyro is called Thomas precession. 

In a single-electron system such as hydrogen, $ = Ze/(4ne 0 r), so 


Hso 


Zati 3 
2 m 2 cr 3 


S L, 


(8.73) 


where the fine-structure constant (8.62) has been used to absorb the 47reo- 
Since the coefficient in front of the operator S • L is positive, spin-orbit 
coupling lowers the energy when the spin and orbital angular momenta are 
antiparallel. 

The operator S • L in equation (8.73) is most conveniently written 

S L = ±((L + S) 2 - L 2 - S 2 ) = ±(J 2 — L 2 — S 2 ), (8.74) 


4 Phil. Mag. 3, 1 (1927) 
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Figure 8.6 The fine structure of 
a hydrogen-like ion with Z = 200 
that is predicted by equation (8.76a). 
The dotted line denotes a break in 
the energy scale so that the ground 
state can be included. 


so Hgo is diagonal in a basis made up of mutual eigenkets of J 2 , L 2 and S 2 . 
In §7.5 we constructed such mutual eigenkets from the eigenkets of S 2 , S z , 
L 2 , and L z . S • L annihilates states with quantum number l = 0 because 
then j = s. Hence, there is no spin-orbit coupling in the ground state of 
hydrogen. In any excited state, l > 0 is permitted, and from §7.5.2 we know 
that the possible values of j are l ± The associated eigenvalues of the 
operator on the right of equation (8.74) are readily found to be 

Hi0 + l)- 1(i + l)-f} = {/ (i + 1) ?; (8.75) 


Although S • L commutes with the gross-structure Hamiltonian Hqs 
(eq. 8 . 1 ), the other operator in Hso , namely r -3 , does not. So the eigenkets 
of Hqs + Hgo w iH differ (subtly) from the eigenkets we have found. In 
§9.1 we shall show that in these circumstances the change in the energy of a 
stationary state can be estimated by replacing the operator by its expectation 
value. Equation (8.60) gives this value, and, inserting this with our results 
for the spin operators, yields energy shifts 


A E = (n, l, m\H S o\n, l, m) ~ K n j 


(.I + 1 )(l + |) for j = l + \ 
-1(1 + 5 ) for j = l — \ 


where 


K n 


Z 4 ah 3 
4ag m%cn 3 


Z 4 a 4 

4n 3 


m e c 2 , 


(l > 0 ), 

(8.76a) 

(8.76b) 


and the second equality uses equation (8.63) for ao- The difference between 
the energies of states with j = l ± \ is 


Ei + 1/2 — Ei _ 1/2 


2 K n 
1(1 + 1 )' 


(8.76c) 


For n = 1 the fine-structure energy scale K n is smaller than the gross- 
structure energy (8.64) by a factor Z 2 /2a 2 that rises from parts in 10 5 for 
hydrogen to more than 10% by the middle of the periodic table . 5 In hydrogen 
fine-structure is largest in the n = 2 , l = 1 level, which is split into j = | and 
j = \ sublevels. According to equations (8.76c), these sublevels are separated 
by K 2 = 4.53 x 10~ 5 eV, while the measured shift is 4.54 x 10~ 5 eV. 

Figure 8.6 shows the prediction of equation (8.76a) for the energy levels 
of a hydrogen-like ion with Z = 200. With this unrealistically large value 

5 Naturally, the fine-structure constant owes its name to its appearance in this ratio 
of the fine-structure and gross energies. 




8.2 Fine structure and beyond 


197 


of Z the fine structure for n = 2 has comparable magnitude to the gross- 
structure difference between the n = 2 and n = 3 levels. The levels in 
this figure are labelled in an obscure notation that is traditional in atomic 
physics and more fully explaine in Box 10.2. The value of n appears first, 
followed by one of the letters S, P, D , F to denote l = 0,1, 2, 3, respectively . 6 
The value of j appears as a subscript to the letter, and the value of 2s + 
1 (here always 2) appears before the letter as a superscript. So the level 
2 2 J D 3/2 has n = 2, s = 1/2, l = 1, and j = |. From Figure 8.6 we see 
that states in which j is less than l (because the electron’s spin and orbital 
angular momenta are antiparallel) are predicted to have lower energies than 
the corresponding states in which the two angular momenta are aligned. The 
spin-orbit interaction vanishes by symmetry for s = 0 but otherwise at fixed 
n the magnitude of the effect decreases with increasing angular momentum 
because the electron’s top speed on a nearly circular orbit is smaller than on 
an eccentric orbit, so relativistic effects are largest on eccentric orbits. 

Equation (8.76a) suggests that states that differ in l but not j should 
have different energies, whereas they do in fact have extremely similar en¬ 
ergies. For example, the 2 2 S 1 / 2 state lies 4.383 x 10 -6 eV above the 2 2 P 1 / 2 
state, while equation (8.76a) implies that this energy difference should be 
| A/> = 6.79 x 10 " 5 eV. This discrepancy arises because the spin-orbit Hamil¬ 
tonian does not provide a complete description to order a 4 of relativistic 
corrections to the electrostatic Hamiltonian. Actually, additional corrections 
shift the energy of the 2 2 S 1 / 2 states into close alignment with the energy of 
the 2 2 Pi/ 2 states . 7 However, in atoms with more than one electron, the 
electrostatic repulsion between the electrons shifts the energy of the 2 2 S ' 1 / 2 
states downwards by much larger amounts. These electrostatic corrections 
are hard to calculate accurately, so the much smaller relativistic corrections 
are not interesting, experimentally, and the quantities of interest are differ¬ 
ence in energy between states with the same values of l but different j. These 
difference are correctly given by equation (8.76a). 

Relativistic quantum electrodynamics is in perfect agreement with mea¬ 
surements of hydrogen. It uses the Dirac equation rather than classically- 
inspired corrections to the electrostatic Hamiltonian. We have devoted sig¬ 
nificant space to deriving the spin-orbit Hamiltonian not because it plays a 
role in hydrogen, but because it becomes important as one proceeds down 
the periodic table. The other relativistic corrections also become large by the 
middle of the periodic table, but outside hydrogen their effects are so masked 
by electron-electron interactions that they are of little practical importance 
and we shall not discuss them in this book. 


8.2.2 Hyperfine structure 

A proton is a charged spin-half particle, so like an electron it has a magnetic 
moment. By analogy with the definition of the Bohr magneton (eq. 8.68), 
we define the nuclear magneton to be 


hp — 


eh 

2m p 


= 5.05 x 10~ 27 JT” 1 . 


(8.78) 


6 These letters are a shorthand for a description of spectral lines that later were found 
to involve the various l values: sharp, principal, diffuse, faint. 

7 In the lowest order of relativistic quantum electrodynamics, the energy of a hydrogen 
atom depends on only n and j : the Dirac equation predicts 


E = - 


n 



a 2 Z 2 



(8.77) 


Thus the 2 2 5' 1 /2 states are predicted to have the same energy as the 2 2 P 1 /2 states. The 
measured Lamb shift between these states arises in the next order as a consequence of 
polarisation of the vacuum, as described in §8.1.3. 
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In terms of /i p , the magnetic-moment operator of the proton is 

H = gpf-ipSp, (8.79) 

where g p = 5.58 and S p is proton’s spin operator, so the proton’s magnetic 
moment is smaller than that of an electron by a factor 2.79 m e /m v ~ 1.5 x 
10 ~ 3 . 

The electron in a hydrogen atom can create a magnetic field at the 
location of the proton in two ways: as a moving charge, it generates a cur¬ 
rent, and it has its intrinsic magnetic moment, so its probability distribution 
|V>(x)| 2 is a distribution of magnetic dipoles that will generate a magnetic 
field just as iron does in a bar magnet. 

The ground level of hydrogen is a particularly simple case because in 
this state the electron has no orbital angular momentum, so it generates a 
magnetic held exclusively through its dipole moment. The magnetic vector 
potential distance r from a magnetic dipole p e is 


4f r 47r V r / 


(8.80) 


The magnetic held is B = V x A, so the hyperfine-structure Hamiltonian 

for the ground state is 


«,-, = IVB=^-V»{Vx(4)). (8.81) 


Until .T/hfs is included in the atom’s Hamiltonian, the atom’s lowest 
energy level is degenerate because the spins of the electron and the proton 
can be combined in a number of different ways. To proceed further we need 
to evaluate the matrix elements obtained by squeezing 77 hfs between states 
that form a basis for the ground-level states. The natural basis to use is 
made up of the states |j = 0 ) and |j = l,m) for m = — 1 , 0,1 that can be 
constructed by adding two spin-half systems (§7.5.1). In Appendix I we show 
that the resulting matrix elements are 


(Vb s|i7 H Fs|Vb s') = ^|t/>(0)| 2 (s|p p • Mels') 

= - r ^IV , (0)| 2 g P Mp2/iB(s|S p • S e |s')> 


(8.82) 


where we have replaced the magnetic moment operators by the appropri¬ 
ate multiples of the spin operators. From our discussion of the spin-orbit 
Hamiltonian (8.73), which is also proportional to the dot product of two 
angular-momentum operators, we know that the eigenstates of the total an¬ 
gular momentum operators are simultaneously eigenstates of S p • S e with 
eigenvalues \{j{j + 1 ) — § — §}, so in this basis the off-diagonal matrix 
elements vanish and the diagonal ones are 


{^,j,m\H HFS \ip,j,m) = — ^-\ip{0)\ 2 gpPpp B {j{j + 1) - §}• (8.83) 


In §9.1 we shall show that the diagonal matrix elements provide a good 
estimates of the amount by which Hhfs shifts the energies of the stationary 
states of the gross-structure Hamiltonian. 

The total angular-momentum quantum number of the atom can be j = 0 
or j = 1 and the two possible values of the curly bracket above differ by two. 
Equation (8.36) gives |?/>(0 )| 2 = l/( 7 rog), so the energies of these levels differ 
by 


A E 


4/tq 

37T(Iq 


5.58p p pB = 5.88 x 1(T 6 


eV. 


(8.84) 


The lower level, having j = 0, is non-degenerate, while the excited state is 
three-fold degenerate. Transitions between these levels give rise to radiation 
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of frequency 1.420 405 7518 GHz. To obtain perfect agreement between this 
most accurately measured frequency and equation (8.84), it is necessary to 
change the gyromagnetic ratio of the electron from the value of 2 that we 
have adopted to the value 2.002319 ... that is predicted by quantum electro¬ 
dynamics. The agreement between theory and experiment is then impressive. 

The hyperfine line of hydrogen provides the most powerful way of tracing 
diffuse gas in interstellar and intergalactic space. Radiation at this frequency 
can propagate with little absorption right through clouds of dust and gas 
that absorb optical radiation. Consequently it was in radiation at 1.4 GHz 
that the large-scale structure of our own galaxy was first revealed in the 
1950s. The line is intrinsically very narrow with the consequence that the 
temperature and radial velocity of the hydrogen that emits the radiation 
can be accurately measured from the Doppler shift and broadening in the 
observed spectral line. The existence of 1.4 GHz line radiation from our 
galaxy was predicted theoretically by H.C. van de Hulst as part of his doctoral 
work in Nazi-occupied Utrecht. In 1951 groups in the USA and Australia 
and the Netherlands, detected the line almost simultaneously. The Dutch 
group used a German radar antenna left over from the war. 

Problems 

8.1 Some things about hydrogen’s gross structure that it’s important to 
know (ignore spin throughout): 

a) What quantum numbers characterise stationary states of hydrogen? 

b) What combinations of values of these numbers are permitted? 

c) Give the formula for the energy of a stationary state in terms of the 
Rydberg 1Z. What is the value of TZ in eV? 

d) How many stationary states are there in the first excited level and in 
the second excited level? 

e) What is the wavefunction of the ground state? 

f) Write down an expression for the mass of the reduced particle. 

g) The wavefunction (x|n) of any state with principal quantum number n 
contains an exponential in r = |x|. Write down the scale length of this 
exponential in terms of the Bohr radius ao- 

h) We can apply hydrogenic formulae to any two charged particles that are 
electrostatically bound. How does the ground-state energy then scale 
with (i) the mass of the reduced particle, and (ii) the charge Ze on the 
nucleus? (iii) How does the radial scale of the system scale with Z1 

8.2 Show, by induction or otherwise, that there are n 2 stationary states of 
hydrogen with energy E = —7 Z/n 2 . 

8.3 In the Bohr atom, electrons move on classical circular orbits that have 
angular momenta lfi 1 where l = 1,2,.... Show that the radius of the first 
Bohr orbit is ao and that the model predicts the correct energy spectrum. 
In fact the ground state of hydrogen has zero angular momentum. Why did 
Bohr get correct answers from an incorrect hypothesis? 

8.4 Show that the speed of a classical electron in the lowest Bohr orbit 
(Problem 8.3) is v = ac, where a = e 2 / 47 reo?ic is the fine-structure constant. 
What is the corresponding speed for a hydrogen-like Fe ion (atomic number 
Z = 26)? Given these results, what fractional errors must we expect in the 
energies of states that we derive from non-relativistic quantum mechanics. 

8.5 Show that Bohr’s hypothesis (that a particle’s angular momentum must 
be an integer multiple of h), when applied to the three-dimensional harmonic 
oscillator, predicts energy levels E = ITiuj with l = 1,2,.... Is there an 
experiment that would falsify this prediction? 

8.6 Show that the electric Held experienced by an electron in the ground 
state of hydrogen is of order 5 x 10 11 V m _1 . Why is it impossible to generate 
comparable macroscopic fields using charged electrodes. Lasers are available 
that can generate beam fluxes as big as 10 22 Wm -2 . Show that the electric 
field in such a beam is of comparable magnitude. 



200 


Problems 


8.7 Positronium consists of an electron and a positron (both spin-half and 
of equal mass) in orbit around one another. What are its energy levels? By 
what factor is a positronium atom bigger than a hydrogen atom? 

8.8 The emission spectrum of the He + ion contains the Pickering series of 
spectral lines that is analogous to the Lyman, Balrner and Pasclien series in 
the spectrum of hydrogen. 

Balrner i = 1,2,... 0.456806 0.616682 0.690685 0.730884 
Pickering i = 2,4,... 0.456987 0.616933 0.690967 0.731183 

The table gives the frequencies (in 10 15 Hz) of the first four lines of the Balrner 
series and the first four even-numbered lines of the Pickering series. The 
frequencies of these lines in the Pickering series are almost coincident with 
the frequencies of lines of the Balrner series. Explain this finding. Provide a 
quantitative explanation of the small offset between these nearly coincident 
lines in terms of the reduced mass of the electron in the two systems. (In 1896 
E.C. Pickering identified the odd-numbered lines in his series in the spectrum 
of the star ( Puppis. Helium had yet to be discovered and he believed that 
the lines were being produced by hydrogen. Naturally he confused the even- 
numbered lines of his series with ordinary Balrner lines.) 

8.9 Tritium, 3 H, is highly radioactive and decays with a half-life of 12.3 
years to 3 He by the emission of an electron from its nucleus. The electron 
departs with 16 keV of kinetic energy. Explain why its departure can be 
treated as sudden in the sense that the electron of the original tritium atom 
barely moves while the ejected electron leaves. 

Calculate the probability that the newly-fornred 3 He atom is in an ex¬ 
cited state. Hint: evaluate (1,0,0 ;Z = 2| 1,0, 0 ;Z = 1). 

8 .10* A spherical potential well is defined by 


V(r) 


0 for r < a 
Vq otherwise, 


(8.85) 


where Vq > 0. Consider a stationary state with angular-momentum quantum 
number l. By writing the wavefunction tH x ) = i?(r)Y[™(0, 4>) and using 
p 2 = p 2 + h 2 L 2 /r 2 , show that the state’s radial wavefunction R(r) must 
satisfy 


/ d_ 
2 m \dr 



1(1 + l)h 2 

2 inr 2 


R + V(r)R = ER. 


( 8 . 86 ) 


Show that in terms of S(r) = rR(r), this can be reduced to 


^ - HI + 1)^ + ^(E-V)S = 0. (8.87) 

Assume that Vo > E > 0. For the case l = 0 write down solutions to this 
equation valid at (a) r < a and (b) r > a. Ensure that R does not diverge 
at the origin. What conditions must S satisfy at r = a? Show that these 
conditions can be simultaneously satisfied if and only if a solution can be 
found to k cot ka = — K , where h 2 k 2 = 2 mE and h 2 K 2 = 2 m(Vo — E). 
Show graphically that the equation can only be solved when y/2mVo a/Ti > 
7 t/ 2 . Compare this result with that obtained for the corresponding one¬ 
dimensional potential well. 

The deuteron is a bound state of a proton and a neutron with zero 
angular momentum. Assume that the strong force that binds them produces 
a sharp potential step of height Vo at interparticle distance a = 2 x 10 -15 m. 
Determine in MeV the minimum value of Vq for the deuteron to exist. Hint: 
remember to consider the dynamics of the reduced particle. 
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8.11 Let the wavefunction of the stationary states of the gross-structure 
Hamiltonian of hydrogen be (x| n, l, m) = u l n {r)Y] n {9, </>). Show that 

drr 2 u l n (r)u l n ,(r) = S nn >. ( 8 . 88 ) 

By considering an appropriate Sturm-Liouville equation, or otherwise, show 
further that 

d r u l n (r)u l n (r) = Ci6 w . (8.89) 




8.12 Show that for hydrogen the matrix element (2,0,0|z|2,1, 0) = —3ao- 
On account of the non-zero value of this matrix element, when an electric 
field is applied to a hydrogen atom in its first excited state, the atom’s energy 
is linear in the field strength (§9.1.2). 

8.13* From equation (8.46) show that l' + | (l + i ) 2 - /3 and that the 

increment A in l' when l is increased by one satisfies A 2 + A(2Z'+1) = 2(Z+1). 
By considering the amount by which the solution of this equation changes 
when V changes from l as a result of /? increasing from zero to a small number, 
show that 

A = 1 + i ^ T + 0 ( /3 2) - (8-90) 

Explain the physical significance of this result. 

8.14 Show that Ehrenfest’s theorem yields equation (8.70) with B = 0 
as the classical equation of motion of the vector S that is implied by the 
spin-orbit Hamiltonian (8.71). 

8.15* (a) A particle of mass m moves in a spherical potential V(r). Show 

that according to classical mechanics 


cl 2 dR de r 

— p x L c ) = mr — -— 

di VH ' dr d t 


(8.91) 


where L c = r x p is the classical angular-momentum vector and e r is the 
unit vector in the radial direction. Hence show that when V(r) = — K/r , 
with K a constant, the Runge-Lenz vector M c = p x L c - mKe r is a 
constant of motion. Deduce that M c lies in the orbital plane, and that for 
an elliptical orbit it points from the centre of attraction to the pericentre of 
the orbit, while it vanishes for a circular orbit. 

(b) Show that in quantum mechanics (p x L)f - pxL = —2ip. Hence 
explain why in quantum mechanics we take the Runge-Lenz vector operator 
to be 

M = i SN — m.Ke r where N = pxL Lxp. (8.92) 

Explain why we can write down the commutation relation [Li, Mj\ = i CijkMk- 

(c) Explain why [p 2 , A] = 0 and why [1/r, pxL] = [1/r, p] x L. Hence 
show that 


[1/r, N] = i|-^(r 2 p-xx-p) - (pr 2 - p • xx) -^-j. (8.93) 

(d) Show that 

\p 2 ,e r ] (V + Jp) +IZ(ft^ x + x ;W)}- ( 8J4 ) 

(e) Hence show that [H, M] = 0. What is the physical significance of 
this result? 
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(f) Show that (i) [M, ; ,L 2 ] = i £\ fc eij k (M k Lj + LjM k ), (ii) [Li, M 2 ] = 0, 
where M 2 = M 2 + M 2 + M 2 . What are the physical implications of these 
results? 

(g) Show that 

[N z , Nj] = — 4i ^ e iju p 2 L u (8.95) 

U 

and that 

[N h (e r )j] - [Nj, (e r )j] = ^ (8.96) 

t 

and hence that 

Afj] = -2i Ti 2 mH ^ e ijk L k . (8.97) 

k 

What physical implication does this equation have? 
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It is rarely possible to solve exactly for the dynamics of a system of experi¬ 
mental interest. In these circumstances we use some kind of approximation 
to tweak the solution to some model system that is as close as possible to 
the system of interest and yet is simple enough to have analytically solvable 
dynamics. That is, we treat the difference between the experimental system 
and the model system as a ‘perturbation’ of the model. Perturbation theory 
in this sense was an important part of mathematical physics before quantum 
mechanics appeared on the scene - in fact the development of Hamiltonian 
mechanics was driven by people who were using perturbation theory to un¬ 
derstand the dynamics of the solar system. Interestingly, while perturbation 
theory in classical mechanics remains an eclectic branch of knowledge that is 
understood only by a select few, perturbation theory in quantum mechanics 
is a part of main-stream undergraduate syllabuses. There are two reasons 
for this. First, analytically soluble models are even rarer in quantum than in 
classical physics, so more systems have to be modelled approximately. Sec¬ 
ond, in quantum mechanics perturbation theory is a good deal simpler and 
works rather better than in classical mechanics. 


9.1 Time-independent perturbations 

Let H be the Hamiltonian of the experimental system and Ho the Hamil¬ 
tonian of the model system for which we have already solved the eigenvalue 
problem. We hope that A = H — Ho is small and define 

Hp = H 0 + pA. (9.1) 

We can think of Hp as the Hamiltonian of an apparatus that has a knob on 
it labelled ‘/3’; when the knob is turned to f3 = 0, the apparatus is the model 
system, and as the knob is turned round to /3 = 1, the apparatus is gradually 
deformed into the system of experimental interest. 

We seek the eigenkets | E) and eigenvalues E of Hp as functions of /3. 
Since the Hamiltonian of the apparatus is a continuous function of /3, we 
conjecture that the \E) and E are continuous functions of /3 too. In fact, we 
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conjecture that they are analytic functions 1 of /3 so they can be expanded as 
power series 

\E) = |a) + (3\b) + /3 2 |c) + • • • ; E = E a + /3E b + j3 2 E c + • • •, (9-2) 

where |a), 16), etc., are states to be determined and E a , E b , etc., are appropri¬ 
ate numbers. When we plug our conjectured forms (9.2) into the eigenvalue 
equation H\ijj) = E\tp), we have 

{Hq + /3A)(|a) + (3\b) +/3 2 |c)+) = (E a + /3E b + f3 2 E c + ) (\a)+(3\b)+/3 2 \c) + ). 

(9.3) 

Since we require the equality to hold for any value of /3, we can equate the 
coefficient of every power of j3 on either side of the equation. 

H 0 \a) = E a \a) 

H 0 \b) + A\a) = E a \b) + E b \a) (9.4) 

Hq\c) + A|5) = E a \c) + E b \b) + E c \a). 

The first equation simply states that E a and |a) are an eigenvalue and eigen- 
ket of Hq. Physically, |a) is the state that we will find the system in if we 
slowly turn the knob back to zero after making a measurement of the energy. 
Henceforth we shall relabel E a with E$ and relabel |a) with \Eq), the zero 
reminding us of the association with /3 = 0 rather than implying that |i?o) 
is the ground state of the unperturbed system. 

To determine E b we multiply the second equation through by {Eq\: 

( E 0 \H 0 \b) + {E 0 \A\E 0 ) = E 0 (E 0 \b) + E b . (9.5) 

Now from Table 2.1, (E 0 \H 0 \b) = (( b\H 0 \E 0 ))* = Eo(Eo\b). Cancelling this 
with the identical term on the right, we are left with 

E b = (E 0 \A\E 0 ). (9.6) 

Thus the first-order change in the energy is just the expectation value of 
the change in the Hamiltonian when the system is in its unperturbed state, 
which makes good sense intuitively. This is the result that we anticipated in 
§§8.2.1 and 8.2.2 to estimate the effects on the allowed energies of hydrogen 
of the spin-orbit and hyperhne Hamiltonians. 

To extract the second-order change in E we multiply the third of equa¬ 
tions (9.4) by ( E 0 \ . Cancelling (Eo\H 0 \c) on E 0 (E 0 \c) by strict analogy with 
what we just did, we obtain 

E c = {E 0 \A\b)-E b {E 0 \b). (9.7) 

To proceed further we have to determine | b), the first-order change in the 
state vector. Since the eigenkets \E n ) of H 0 form a complete set of states, 
we can write |6) as the sum 


/ 3 ° 

/ 3 1 

0 1 


\b)=J2 b k\E k ). (9.8) 

k 

In the second of equations (9.4) we replace \b) by this expansion and multiply 
through by ( Em\ ^ (^o| to find 

, _ (Em\A\E 0 ) 

0rn — J7 J7 

-&0 — frm 


1 Much interesting physics is associated with phenomena in which a small change in 
one variable can produce a large change in another (phase changes, narrow resonances, 
caustics, ...). In classical physics perturbation theory is bedevilled by such phenomena. 
In quantum mechanics this conjecture is more successful, but still untrustworthy as we 
shall see in §9.1.2. 
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Box 9.1: Ensuring that (a\b) = 0 


Since the perturbed eigenket should be properly normalised, we have 
1 = (E\E) = «.Eb| + m + • • •) (\E 0 ) + p\b) + • • •) 

= l + P((E 0 \b) + (b\E 0 )) + OOd 2 ). 


Equating the coefficient of /? on each side of the equation we conclude 
that (Eo\b) + (b\Eo) = 0, from which it follows that (Eo\b) is pure imagi¬ 
nary. The phase of | E) is arbitrary, and we are free to choose this phase 
independently for each model Hamiltonian Hp. In particular, instead of 
using | E) we can use | E') = e 1Q/3 | E), where a is any real constant: | E') 
is our original eigenket but with its phase shifted by a linear function of 
/ 3. When we expand | E') in powers of /3 we have 

\E') = \E 0 ) + (3\b') + ■ ■ ■, 


where | b') is the derivative of | E') with respect to f3 evaluated at j3 = 0. 
This is 


I b') 


d| E') 


d/3 


/3=0 


d 

d/3 


(e iQ/3 | E)) 


/3=0 


ia| E 0 ) + | b). 


Consequently, 

{E 0 \b') = ia + (E 0 \b). 

Since (Eo\b) is known to be pure imaginary, it is clear that we can choose 
a such that (Eo\b') = 0. This analysis shows that the phases of the 
perturbed eigenkets can be chosen such that the first order perturbation 
16) is orthogonal to the unperturbed state \E 0 ) and one generally assumes 
that this choice has been made. 


This expression determines the coefficient of all kets in (9.8) that have en¬ 
ergies that differ from the unperturbed value Eq. For the moment we as¬ 
sume that Eq is a non-degenerate eigenvalue of Hq, so there is only one 
undetermined coefficient, namely that of |Eo). Fortunately we can argue 
that this coefficient can be taken to be zero from the requirement that 
| E) = \Eq) + f3\b) + 0(/3 2 ) remains correctly normalised. The complete ar¬ 
gument is given in Box 9.1 but we can draw a useful analogy with changing 
a three-dimensional vector so that the condition |r| = 1 is preserved; clearly 
we have to move r on the unit sphere and the first-order change in r is nec¬ 
essarily perpendicular to the original value of r. The quantum-mechanical 
normalisation condition implies that as /3 increases | E) moves on a hyper¬ 
sphere in state space and (E 0 \b) = 0. So we exclude |Eo) from the sum in 
(9.8) and have that the first-order change to the stationary state is 


w = E 

ra /0 


(E m \A\E 0 ) 
Eq — E m 


| E m ). 


(9.10) 


When this expression for | b) is inserted into equation (9.7), we have that the 
second-order change in E is 


e c = j2 

k=jt 0 


(E 0 \A\E k )(E k \A\E 0 ) 
Eq — Ek 


(9.11) 


9.1.1 Quadratic Stark effect 

Let’s apply the theory we’ve developed so far to a hydrogen atom that has 
been placed in an electric field E = — Vd>. An externally imposed electric 
field is small compared to that inside an atom for field strengths up to £ ~ 
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5 x 10 11 Vm _1 (Problem 8.6) so perturbation theory should yield a good 
estimate of the shifts in energy level that ordinary fields effect. By the 
definition of the electrostatic potential <F, the field changes the energy of the 
atom by 

SE = e{$(x p ) - <F(x e )}, (9.12) 

where x p and x e are the position vectors of the proton and electron, respec¬ 
tively. We assume that the field changes very little on the scale of the atom, 
and, as in §8.1, we define r = x e — x p . Then we may write 


SE ~ -er • V$ = er • E. (9.13) 

We orient our coordinate system so that E is parallel to the z axis and use 
the notation £ = |E|. Then it is clear that the effect of imposing an external 
electric field is to add to the unperturbed Hamiltonian a term 


A = e£z. (9.14) 

Suppose the atom is in its ground state 1100), where the digits indicate the 
values of n, l and m. Then from equation (9.6) the first-order energy change 
in E is 

E b = e£(100|z|100). (9.15) 

In §4.1.4 we saw that the expectation value of any component of x vanishes 
in a state of well-defined parity. Since the ground-state ket 1100) has well 
defined (even) parity, E b = 0, and the change in E is dominated by the 
second-order term E c . For our perturbation to the ground state of hydrogen, 
equation (9.11) becomes 


E , = « 2 £ 2 £ £ 

n—2 i<n 
|m| <1 


(100|^|nZm)(nZm|^|100) 
Ei — E n 


(9.16) 


Symmetry considerations make it possible to simplify this sum dramatically. 
First, since [L z ,z] = 0 (Table 7.3), z\nlm) is an eigenfunction of L z with 
eigenvalue m, and therefore orthogonal to 1100) unless m = 0. Therefore 
in equation (9.16) only the terms with m = 0 contribute. Second, we can 
delete from the sum over l all even values of l because, as we saw in §4.1.4, 
the matrix elements of an odd-parity operator between states of the same 
parity vanish. In fact, a result proved in Problem 7.21 shows that the terms 
with l = 1 are the only non-vanishing terms in the sums over l in (9.16). 
Thus 


E c = e 2 £ 2 J2- 

n=2 


(100|z|nl0)(nl0|z|100) 
Ei — E n 


(9.17) 


It is easy to understand physically why the change in E is proportional 
to £ 2 . In response to the external electric field, the probability density of the 
atom’s charge changes by an amount that is proportional to the coefficients 
bk, and these coefficients are proportional to £. That is, the field polarises 
the atom, generating a dipole moment P that is oc £. The dipole’s energy is 
PE, so the energy change caused by the field is proportional to £ 2 . 


9.1.2 Linear Stark effect and degenerate perturbation theory 

Consider now the shift in the energy of the n = 2, l = 0 state of Hydrogen 
when an electric field is applied. The sum over k in (9.16) now includes the 
term 

(200|2|210)(210|z|200) 

E20 — E21 

which is infinite if we neglect the very small Lamb shift (§8.1.3), because 
the top is non-zero (Problem 8.12) and the difference of energies on the 
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bottom vanishes. It hardly seems likely that a negligible field will produce 
an arbitrarily large change in the energy of the first excited state of hydrogen. 
So what did we do wrong? 

Our error was to assume at the outset that a small stimulus produces a 
small response as we did when we wrote equations (9.2). Our infinite con¬ 
tribution to E c can be traced to our expression (9.9) for b m , which diverges 
as E m —> Eq. That is, the change in the wavefunction that a given field pro¬ 
duces is inversely proportional to the energy difference between the original 
state |E 0 ) and the state \E m ) we are pushing the system towards. This is an 
entirely reasonable result, analogous to what happens as we push a marble 
that lies at the bottom of a bowl: the distance the marble moves before 
coming into equilibrium depends on the curvature of the bowl. In the limit 
that the curvature goes to zero, and the bottom of the bowl becomes flat, an 
infinitesimal force will move the marble arbitrarily far, because all locations 
have the same energy. So we conclude that when the system’s initial energy 
is a degenerate eigenvalue Ed of Hq, a tiny stimulus is liable to produce a big 
change in the state (but not the energy) of the system. Disaster will attend 
an attempt to calculate this abrupt change of state by the approach we have 
been developing. 

So must we just give up in despair? No, because we can see that the 
only states that are going to acquire a non-negligible amplitude during the 
abrupt change are ones that have the same energy as -Ed- That is, the state 
to which the system abruptly moves can be expressed as a linear combination 
of the kets belonging to Ed- In many cases of interest there are only a small 
number of these (four in the problem of hydrogen on which we are working). 
What we have to do is to diagonalise the matrix A. tJ formed by A squeezed 
between all pairs of these kets. The eigenkets of A in this small subspace 
will be states of well-defined energy in the slightly perturbed system. As /3 
is ramped up from zero to unity their energies will diverge from Ed- We 
conjecture that in the instant that j3 departs from zero, the system’s state 
jumps to the eigenket with the lowest energy, and subsequently stays in this 
state as /3 increases. If this conjecture is correct, we should be able to use the 
perturbation theory we have developed provided we use as basis kets ones 
that diagonalise A as well as Hq. 

So let’s diagonalise e£z in the 4-dinrensional subspace of Hydrogen kets 
with n = 2. When we list the kets in the order 1200), |210), |211), |21—1), 
the matrix of A looks like this 


/ 0 a 0 0 \ 

a* 0 0 0 I 

0 0 0 0 

VO 000/ 


where a = ( 200 |z| 210 ). 


(9.18) 


From Problem 8.12 we have that (200|z|210) = —3ao- It is now easy to show 
that the eigenvalues of A are ±3e£ao and 0, while appropriate eigenkets are 
2 -1 / 2 (l, =f 1, 0, 0), (0,0,1,0) and (0, 0, 0,1). We conclude that as soon as the 
slightest perturbation is switched on, the system is in the state of lowest 
energy, \ip) = 2 -1 / 2 (|200) + |210)), and we use this state to determine Eft. 
We find 

Eft = \e£ ((200| + (210|)z(|200) + |210» 

o c 
— — OCLqCLs . 

From our discussion of the quadratic Stark effect, we know that a change 
in E that is proportional to £ requires the dipole moment P of an atom to be 
independent of £. Since Eft is proportional to £ we conclude that a hydrogen 
atom in the n = 2 state has a permanent electric dipole. 

In classical physics this result is to be expected because the orbit of the 
electron would in general be elliptical, and the tinre-averaged charge density 
along the ellipse would be higher at the apocentre than at the pericentre, 2 be¬ 
cause the electron lingers at the apocentre and rushes through the pericentre. 

2 An orbit’s apocentre is the point furthest from the attracting body, while the 
pericentre is the point nearest that body. 
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Figure 9.1 The charge distribu¬ 
tion of the state (1200) + 1210)) /\/2 
is axisymmetric. Here we plot the 
distribution in the ( R , z) plane of 
cylindrical polar coordinates. 


Hence the centre of charge would lie on the opposite side of the geometrical 
centre of the ellipse from the focus, where the proton’s cancelling charge lies. 
Thus, if the electron’s orbit were a perfect Kepler ellipse, the atom would 
have a permanent electric dipole moment parallel to the orbit’s major axis. 
Any deviation of the radial force field from F oc r -2 will cause the major 
axis of the ellipse to precess, and therefore the tinre-averaged polarisation of 
the atom to be zero. In hydrogen the force-field deviates verify little from 
an inverse-square law, so the precession occurs very slowly in the classical 
picture. Consequently, even a weak external field can prevent precession and 
thus give rise to a steady electric dipole. 

In the quantum-mechanical picture, shielding shifts the energy of the S 
state below that of the P states, thus ensuring that, in the absence of an 
imposed field, the atom is spherical and has no dipole moment. An electric 
field deprives L 2 of its status as a constant of motion because the field can 
apply a torque to the atom. Shielding is a very weak effect in hydrogen 
(because it relies on the vacuum’s virtual electrons and positrons), so the 
S state lies very little below the P states and in even a weak electric field 
this offset becomes irrelevant. The lowest-energy state becomes (1200) + 
|210))/-y/2. This is not an eigenket of L 2 but it is an eigenket of L z with 
eigenvalue zero. Thus its angular momentum is perpendicular to the field, 
as we expect from the classical picture of a Kepler ellipse with its major axis 
parallel to E. Figure 9.1 shows that in this state the charge distribution 
comprises a dense cloud around the origin and an extended cloud centred on 
R = 0, x ~ — 3ao- We can think of these clouds as arising from pericentre 
and apocentre, respectively, of eccentric orbits that have their major axes 
roughly aligned with the negative z axis. The integral / d 3 x, z\ip\ 2 = — 3ao, 
so in this state the atom has dipole moment P = +3eao- 


9.1.3 Effect of an external magnetic field 

When an atom is placed in a magnetic field, the wavelengths of lines in 
its spectrum change slightly. Much of quantum mechanics emerged from 
attempts to understand this phenomenon. We now use perturbation theory 
to explain it. 

In §3.3 we discussed the motion of a free particle in a uniform magnetic 
field. Our starting point was the Hamiltonian (3.31), which governs the mo¬ 
tion of a free particle of mass m and charge Q in the magnetic field produced 
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by the vector potential A. This is the Hamiltonian of a free particle, p 2 /2m, 
with p replaced by p — Q A. Hence we can incorporate the effects of a mag¬ 
netic field on a hydrogen atom by replacing p n and p e in the gross-structure 
Hamiltonian (8.1) with p p — eA and p e + eA, respectively. With Z = 1 the 
kinetic energy term in the Hamiltonian then becomes 


_ (p p - eA) 2 (p e + eA) 2 

ri I<E = 


2 ?n D 


2 m e 


2m r 


2 m e 


+ \ — — — ' A 


Pp 


-A-l * - PH 


We neglect the terms that are 0(A 2 ) on the grounds that when the 
weak enough for the 0(A) terms to be small compared to the terms 
gross-structure Hamiltonian, the 0(A 2 ) terms are negligible. 


0(A 2 ) 

(9.20) 
field is 
in the 


Equation (8.4) and the corresponding equation for (9/<9x p imply that 


Tfl 

Pe =-—-Px + Pr and p p 

m e + m p 


m p 

m e + m p 


PX Pr, 


(9.21) 


where px is the momentum associated with the centre of mass coordinate 
X, while p r is the momentum of the reduced particle. From the algebra that 
leads to equation ( 8 . 6 a) we know that the first two terms on the right of 
the second line of equation (9.20) reduce to the kinetic energy of the centre- 
of-mass motion and of the reduced particle. Using equations (9.21) in the 
remaining terms on the right of equation (9.20) yields 


Hke — 


Px 


2 (m e + to p ) 


EL 

2p 


2/i 


(p r ■ A + A-p r ), 


(9.22) 


where p is the mass of the reduced particle (eq. 8 . 6 b). It follows that an 
external magnetic field adds to the gross-structure Hamiltonian of a hydrogen 
atom a perturbing Hamiltonian 

H B = ^-(Pr-A + A-Pr). (9.23) 

On the scale of the atom the field is likely to be effectively homogeneous, so 
we may take A = -^B x r (page 49). Then Hb becomes 

Hb — ~~ —(p ■ B x rtB x r ■ p), (9.24) 

4?n e 

where we have approximated p by in e and dropped the subscript on p. 
The two terms in the bracket on the right can both be transformed into 
B r x p = KB ■ L because (i) these scalar triple products involve only 
products of different components of the three vectors, and (ii) [x-ippf = 0 
for i j. Hence, we do not need to worry about the order of the r and 
p operators and can exploit the usual invariance of a scalar triple product 
under cyclic interchange of its vectors. 

If an atom has more than one unpaired (‘valence’) electron, each electron 
will contribute a term of this form to the overall Hamiltonian. We can fold 
these separate contributions into a single contribution Hb by interpreting L 
as the sum of the angular-momentum operators of the individual electrons. 

In §8.2.1 we discussed terms that must be added to hydrogen’s gross- 
structure Hamiltonian to account for the effects of the electron’s intrinsic 
dipole moment. We found that the coupling with an external field is gener¬ 
ated by the Zeeman spin Hamiltonian (8.72). Adding this to the value of Hb 
that we have just computed, and orienting our coordinate system so that the 
2 axis is parallel to B, we arrive at our final result, namely that a uniform 
magnetic field introduces a perturbation 

pft 

BBs — B(L z + 2 S z ) = pbB(J z + S z ), 


(9.25) 
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Zeeman P—B 



Figure 9.2 Eigenvalues of the spin-dependent Hamiltonian A L ■ S + B(L Z + 2 S z ) as 
functions of B/A for the case Z = 1, s = 5 . The right side of the diagram (field strong 
compared to spin-orbit coupling) quantifies quantifies the Paschen—Back effect, while the 
left side of the diagram quantifies the Zeeman effect (weak field). The top and bottom 
lines on the extreme right show the energies of the states | 1 , 1 )|+) and 11 , — 1 )|—), which 
are eigenstates of the full Hamiltonian for all values of B/A. 


where S is the sum of the spin operators of all the valence electrons. 

The Hamiltonian formed by adding Hb s to the gross-structure Hamil¬ 
tonian (8.1) commutes with L 2 , L z , S 2 and S z . Its eigenkets are simply the 
eigenkets of the gross-structure Hamiltonian upgraded to include eigenvalues 
of S 2 and S z . The only difference from the situation we studied in §8.1 is 
that the energies of these eigenkets now depend on both L z and S z . Hence, 
each energy level of the gross-structure Hamiltonian is split by the magnetic 
field into as many sub-levels as mi + 2 m s can take. For example, if 1 = 0 
and .s = 2 i there are two sublevels, while when l = 1 and s = ^ there are 
five levels in which mi + 2 m s ranges between ± 2 . 

In practice the perturbation Hb s always acts in conjunction with the 
spin-orbit perturbation if go of equation (8.73 ). 3 The general case in which 
Hb s and i?so are comparable, requires numerical solution. The extreme 
cases in which one operator is larger than the other can be handled analyti¬ 
cally. 

Paschen—Back effect In a sufficiently strong magnetic field, ifso affects 
the atom much less than Hb s , so Hso simply perturbs the eigenkets of the 
Hamiltonian formed adding Hb s to the gross-structure Hamiltonian. The 
change in the energy of the state | n, l , mp s , m s ) is 

E b = (n,l,mi,s,m a \H so \n,l,mi,s,m. s ) ^ ^ 

= C(n, l,mi,s,m s \L • S| n,l,mi,s,m a ), 

where £ is a number with dimensions of energy that is independent of mi 
and m a . By writing L- S = ^(L + S_ + L_S + ) + L,S Z (eq. 7.143) we see that 
(L • S) = m;m s . So in a strong magnetic field the eigenenergies are 

Egross + Mb B(mi + 2 m s ) + C mim s . (9.27) 

The levels on the extreme right of Figure 9.2 show the energies described by 
this formula in the case that l = 1 and s = i. The fact that in a strong 
magnetic field an atom’s energies depend on mi and m s in this way is known 

as the Paschen—Back effect. 

Zeeman effect In a sufficiently weak magnetic field, Hgo affects the atom 
more strongly than Hb s - Then spin-orbit coupling assigns different energies 

3 There is no spin-orbit coupling for an S state, but an allowed spectral line from an S 
state will connect to a P state for which there is spin-orbit coupling. Hence the frequencies 
of allowed transitions inevitably involve spin-orbit coupling. 
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to states that differ in j. Consequently, when we use perturbation theory 
to calculate the smaller effect of an imposed magnetic field, the degenerate 
eigenspace in which we have to work is that spanned by the states that have 
given values of j, / and s but differ in their eigenvalues in of J z . Fortunately, 
Hbs is already diagonal within this space because [J Z ,S Z ] = 0. So the shift 
in the energy of each state is simply 

E b = (j,m,l,s\H Bs \j,m,l,s) = p B (m + (j,m,l,s\S z \j,m,l,s)). (9.28) 

As we saw in §7.5, our basis states do not have well-defined values of S z 
- in general they are linear combinations of eigenstates of L z and S z : 

S 

| j,m,l,s)= ^2 c m '\l, m — m')\s, m'), (9.29) 

m'=—s 

where the coefficients c m > are Clebsch-Gordan coefficients (eq. 7.152). In any 
concrete case it is straightforward to calculate the required expectation value 
of S z from this expansion. However, a different approach yields a general 
formula that was important historically. 

In the classical picture, spin-orbit coupling causes the vector S to precess 
around the invariant vector J. Hence, in this picture the expectation value of 
S is equal to the projection of S onto J. 4 The classical vector triple product 
formula enables us to express S in terms of this projection: 

J x (S x J) = J 2 S — (S • J)J so J 2 S = (S • J)J + J x (S x L). (9.30) 

In the classical picture, the expectation value of the vector triple product 
on the right side vanishes. If its quantum expectation value were to vanish, 
the expectation value of the z component of the equation would relate (S z ), 
which we require, to the expectation values of operators that have the states 
| j, m, l, s) as eigenstates, so our problem would be solved. Motivated by these 
classical considerations, let’s investigate the operator 

G = J X (S X L) SO Gi = ^ ( CijkJj^klm^lLm- (9.31) 

jklm 


It is straightforward to check that its components commute with the angular- 
momentum operators Ji in the way we expect the components of a vector to 
do: _ 

[J i ,G j ]=i'52e ijk G k . (9.32) 

k 

From equation (9.31) it is also evident that J • G = 0. In Problem 7.21 
identical conditions on the operators L and x suffice to prove that (x) = 0 in 
any state that is an eigenket of L 2 . So the steps of that proof can be retraced 
with L replaced by J and x replaced by G to show that for the states of 
interest (G) = 0. 

Now that we have established that the quantum-mechanical expectation 
value of G does indeed vanish, we reinterpret equation (9.30) as an operator 
equation, and, from the expectation value of its 2 component, deduce 

(j,m,l,s\S z \j,m,l,s) = (9.33) 

J{J + 1) 

From equation (8.74) we have 

J-S = L- S + S ' 2 = \{J 2 — L 2 + S 2 ), (9.34) 


4 This heuristic argument is often referred to as the vector model. 
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so we find 

E b = to^lMb-B where gL = ^1 + ^ + 1 ^ + 5 ^ + 1 ^ . (9.35) 

The factor g l is called the Lande g factor. In the early days of quantum 
theory, when the Bohr atom was taken seriously, people expected the mag¬ 
netic moment of an electron to be ±/in and therefore thought a magnetic 
field would shift energy levels by ±/zb B. Equation (9.35) states that the 
actual shift is mg^ times this. When this factor differed from unity, they 
spoke of an anomalous Zeeman effect. 

The left hand side of Figure 9.2 shows the energy levels described by 
equation (9.35) in the case l = 1, s = The possible values of j are | and 
and the magnetic field splits each of these spin-orbit levels into 2 j + 1 
components. 


9.2 Variational principle 

We now describe a method of estimating energy levels, especially a system’s 
ground-state energy, that does not involve breaking the Hamiltonian down 
into a part that has known eigenkets and an additional perturbation. In 
Chapter 10 we shall show that this method yields quite an accurate value 
for the ionisation energy of helium. 

Let H be the Hamiltonian for which we require the eigenvalues E n and 
the associated eigenkets | n). We imagine expanding an arbitrary state \tp) = 
Y^ n a n\ n ) as a linear combination of these eigenkets, and then calculate the 
expectation value of H in this state as 

(H) = (ip\H\t/j) = , (9.36) 

Ej kr 


where we have included the sum of the k'| 2 on the bottom to cover the 
possibility that Ik is not properly normalised. ( H ) is manifestly independent 
of the phase of a,. We investigate the stationary points of (H) with respect 
to the moduli |cq| by differentiating equation (9.36) with respect to them: 

9{H) 2\a k \E k _ 2|afc| Ej \ a i\ 2 Ej 

« ^'N 2 (E,klf ' 

Equating this derivative to zero, we find that the conditions for a stationary 
point of ( H) are 


0 = \a k \ E k 


E 


ad 


-Ei 


l E % 


(A = 0,1,...) 


(9.38) 


These equations are trivially solved by setting a k = 0 for every fc, but then 
\t/j) = 0 so the solution is of no interest. For any value of k for which a k ^ 0, 
we must have 


E k 


E ; \ai\ 2 E t 
EJk 2 ' 


(9.39) 


Since the right side of this equation does not depend on k , the equation can 
be satisfied for at most one value of k , and it clearly is satisfied if we set cq = 0 
for i ^ k and a k = 1, so | -0) = |fc). This completes the proof of Rayleigh’s 
theorem: The stationary points of the expectation value of an Hermitian 
operator occur at the eigenstates of that operator. Moreover, all eigenstates 
provide sta tionary points of the operator. That is, for general | ij>) the number 
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of {i/j\H\ip) isn’t equal to an eigenvalue of H , but if is stationary 

with respect to | t/j) in the sense that it doesn’t change when \^j) is changed 
by a small amount, Rayleigh’s theorem tells us that the number {%l)\H\ip) 
is an eigenvalue of H. Problem 9.10 gives a geometrical interpretation of 
Rayleigh’s theorem. 

The stationary point associated with the ground state is a minimum of 
(H). To see that this is so, we subtract the ground-state energy E 0 from 
both sides of equation (9.36) and have 


(H) -Eq = 


Ej kl 2 (£> - Eq) 
Ej Kl 2 


(9.40) 


Both the top and bottom of the fraction on the right are non-negative, so 
( H) > E 0 . The stationary points of ( H) associated with excited states are 
saddle points (Problem 9.14). 

The practical use of Rayleigh’s theorem is this. We write down a trial 
wavefunction ^> a (x) that depends on a number of parameters oi,...,ajv- 
These might, for example, be the coefficients in an expansion of ip a as a 
linear combination of some convenient basis functions iq(x) 


N 

V’a(x) = ^ djUi(x). (9.41) 

i—1 

More often the cq are parameters in a functional form that is motivated 
by some physical argument. For example, in Chapter 10 we will treat the 
variable Z that appears in the hydrogenic wavefunctions of §8.1.2 as one of 
the cq. Then we use to calculate ( H ) as a function of the <q and find 
the stationary points of this function. The minimum value of ( H) that we 
obtain in this way clearly provides an upper limit on the ground-state energy 
Eq. Moreover, since ( H) is stationary for the ground-state wavefunction, 
(. H) — Eq increases only quadratically in the difference between if> a and the 
ground-state wavefunction i/)q. Hence, with even a mediocre fit to ipQ this 
upper limit will lie close to Eq. This approach to finding eigenvalues and 
eigenfunctions of the Hamiltonian is called the variational principle. 

In Problems 9.12 and 9.13 you can explore how the variational principle 
works in a simple case. 


9.3 Time-dependent perturbation theory 

We now describe a way of obtaining approximate solutions to the tdse (2.26) 
that we shall use to study both scattering of particles and the emission and 
absorption of radiation by atoms and molecules. 

Consider the evolution of a system that is initially in a state that is 
nearly, but not quite, in a stationary state. Specifically, at t = 0 it is in the 
Nth eigenstate of a Hamiltonian Hq that differs by only a small, possibly 
time-dependent, operator V from the true Hamiltonian H: 

H = Hq + V. (9.42) 

Inspired by (2.32) we expand the solution to the tdse for this H in the form 

\^) = Y J a n{t)e- iE - t/h \E n ), (9.43) 


where \E n ) is a (time-independent) eigenket of Hq with eigenvalue E n . This 
expansion doesn’t restrict \4>) because {| E n )} is a complete set and the func¬ 
tions a n (t) are arbitrary. Substituting it into the tdse we have 


= ( H 0 + V)\i/>) = Y, (En\E n ) + V\E n )y n e~ iE ^ h 

n 

= ^ + E n a n )e~ iE ^ h \E n ). 


(9.44) 


n 
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We simplify this by multiplying through by (E k |: 

i ha k e-' lEkt/n = Y, a n e- iE ^ h (E k \V\E n ). (9.45) 

n 


This constitutes a set of linear ordinary differential equations for the a n (t) 
which must be solved subject to the boundary conditions o/v(0) = 1 and 
a n (0) = 0 for n ^ N. Hence, at the earliest times the term on the right of 
(9.45) with n — N will dominate the equation of motion of a k with k ^ N, 
and we have the approximation 


a k ~ -je-^ EN - Ek ^ n {E k \V\E N ). (9.46) 

We now assume that any time dependence of V takes the form V (t) = Voe la,t , 
where Vo is a time-independent operator. This assumption is in practice not 
very restrictive because the theory of Fourier analysis enables us to express 
any operator of the form Vof(t) , where / is an arbitrary function, as a linear 
combination of sinusoidally varying operators. Replacing V by Voe lwt in 
equation (9.46) and integrating from t = 0 we find 


a k {t) 


{E k \V 0 \E N ) 
En — E k — Tun 


e —i(E N —E k —hu)t '/ft 


(9.47) 


so the probability that after t the system has made the transition to the fcth 
eigenstate of H 0 is 


Pk(t) 


|afc | 2 

\{E k \V 0 \E N )\ 2 
(E n - E k - fun) 2 


^2 — 2 cos 


^ {En ^ E k - Tun)t ^ | 


d\(E k \V 0 \E N )\ 2 


sin 2 ((En — E k — Tiu>)t/2h) 
(En - E k - Tun) 2 


(9.48) 


For a time of order Tij (En — E k — Tun) this expression grows like t 2 . Subse¬ 
quently it oscillates. 


9.3.1 Fermi golden rule 

In many applications of equation (9.48), there are a large number of station¬ 
ary states of Ho with energies E k that lie within h/t of 

-E'out = En — Tun , (9.49) 


and we are interested in the probability that the system has made the tran¬ 
sition to any one of these states. Hence we sum the P k over k. Let there 
be g(E k )dE k eigenvalues in the interval (E k + d E k ,E k ). Then the total 
transition probability is 


Y p k(i) = 4 J dE k g(E k )\(E k \V\E N )\ 2 

k 


sin 2 ((E out - E k )t/2h) 
(E 0ut - E k ) 2 


^ J dxg(E out 


2 hx) | (E out - 2Tix\V\E n )\ 2 


(9.50) 


where we’ve introduced a new variable, x = (E ou t — E k )/2Ti. For given t, 
the function ft(x) = sin 2 (xt)/x 2 is dominated by a bump around the origin 
that is of height t 2 and width 27r/f. Hence, the area under the bump is 
proportional to t and in the limit of large t, 


sin 2 (xt) 
x 2 


oc td(x). 


(9.51) 
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We find the constant of proportionality by differentiating f dx ft. with respect 
to t: 


cl f°° sin 2 (xt) 

' dx-~— 


d t 


, sin(2xt) 
dx- = 7r, 


(9.52) 


where we have used a result, f dx sinx/x = 7r, from the theory of contour 
integration in the complex plane. Hence 

lim —-M = ntS(x). (9.53) 

t-> oo X 2 

Inserting this relation in (9.50) and integrating over x, we have finally 


^2 p k = g{Eout ) |(out|y|in)| 2 . (9.54) 


This equation establishes Fermi’s golden rule 5 of perturbation theory: a 
perturbation Ve lult causes a system to transition to a new state lower in 
energy by huj at a rate equal to 2 tt/Ti times the mod-square of the matrix 
element ofV between the initial and final states times the density of states 
at the final energy. It is easy to see that if the time-dependence of the 
perturbation were e~ lut , it would cause transitions at the same rate to states 
higher in energy by Tun. 


9.3.2 Radiative transition rates 

We now use equation (9.48) to calculate the rate at which a electromagnetic 
waves induce an atom to make radiative transitions between discrete sta¬ 
tionary states. Our treatment is valid when the quantum uncertainty in the 
electromagnetic field may be neglected, and the field treated as a classical 
object. This condition is satisfied, for example, in a laser, or at the focus of 
the antenna of a radio telescope. 

Whereas in our derivation of Fermi’s golden rule, we took the frequency 
in of the perturbation to be fixed and assumed a continuum of final states, 
now that we are considering the case of a discrete final state, we argue 
that the electromagnetic field is a superposition of plane waves of various 
frequencies, and that we should sum the transition probability (9.48) that 
each wave independently contributes. Thus we write 

W) = 4 £ |(E t |4K,|E„)|4 Sin2 ( g~ (9.55) 

™ ( E * ~ E *- M 2 



where SVq and in relate to an individual wave. 

In vacuo the electric field of an electromagnetic wave is divergence free, 
being entirely generated by Faraday’s law, VxE = —<9B/i 9t. It follows that 
the whole electromagnetic field of the wave can be described by the vector 
potential A through the equations 


B = V x A and E = -dA/dt. (9.56) 

We are considering a superposition of plane waves, which individually have 
a potential 

<5A(x, t) = t>A 0 cos(k • x — int), (9.57) 

where <5Ao is a constant vector and k is the wavevector. From equations 
(9.56) and (9.57) we have that the wave’s contribution to the electric field is 

(5E(x, t) = — u5Aq sin(k • x — int), (9.58) 

5 The golden rule was actually first given by P.A.M. Dirac, Proc. Roy. Soc. A, 114, 
243 (1927) 
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so £E is parallel to SAq. From V • (5E = 0 it follows that 


k • S Aq = 0 , 


(9.59) 


so k • (5E = 0 and the wave is transverse. 

In §9.1.3 we saw that an external electromagnetic field adds to an atom’s 
Hamiltonian the perturbing term (9.23) for each electron. In the present case 
the perturbation is 

SV (x, t ) = 2 _— (P ' <5A.q cos(k • x — uit ) + cos(k • x — uit)SAo • p}. (9.60) 


By virtue of equation (9.59), <5Ao-p commutes with kx because a component 
of momentum always commutes with a perpendicular component of position. 
Since <5Aq is a constant, it commutes with p. So we can simplify SV to 


SV (x, t) = — SA q • p cos(k • x — wt) 
m e 

= i ,5Aop ( e '‘ k 


x—uit) _|_ i(k-x— cot) 


(9.61) 


We now make the approximation that the electromagnetic wavelength 
is much bigger than the characteristic size of the atom or molecule. This is 
a good approximation providing the atom or molecule moves between states 
that are separated in energy by much less than am e c 2 (Problem 9.20), as 
will be the case for waves with frequencies that are less than those of soft 
X-rays. In this case we will have k-x<l for all locations x in the atom 
or molecule at which there is significant probability of finding an electron. 
When this condition is satisfied, it makes sense to expand the factors e ±lkx 
in equation (9.61) as a power series and discard all but the constant term. 
We then have 

SV(x, i) = ^- SA 0 • p (e“ iwt + e iwt ), (9.62) 

where we have retained the exponentials in time because large values of 
t cannot be excluded in the way that we can exclude large values of x. 
Finally, we note that in the gross-structure Hamiltonian Hq , p occurs only 
in the term p 2 /2m e , so [H 0 ,x\ = —i(h/m e )p. When we use this relation to 
eliminate p from equation (9.62), we have 

SV(x,t) = + e™% (9.63) 

where we have chosen to make the z axis parallel to Ao. Thus a plane elec¬ 
tromagnetic wave gives rise to perturbations with both positive and negative 
frequencies. Above we derived the frequency condition u> = (Em — Ek)/h 
for transitions from \En) to | Ek), so the negative frequency perturbation 
is associated with excitation of the system (Ek > E/v), while the positive 
frequency perturbation is associated with radiative decays. 

We identify the time-independent part of SV as the operator SVq that 
occurs in equation (9.55) and then have that the net transition probability 
is 




e2 \2\/v i r it jip \ |2 s in 2 ((En — Ek — Tiu})t/2K) 

- 2 ^(SA 0 ) \(E k \[H 0 ,z]\E N )\ - ^ M2 - 


e 

4ft 1 


(E k ~ E N f\{E k \z\E N )\ 2 ]T (M 0 ) 2 ^, 


x = (Em — Ek — hui)/2%. 


where 


(9.64a) 

(9.64b) 
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Even though the expression 


P = 2^ 0 {{E/C)2 + B2} (9 ' 65) 

for the energy density of an electromagnetic field is quadratic in the field 
amplitudes E and B, the volume-averaged energy density of a superposition 
of plane waves is just the sum of the energy densities of each individual 
wave. Moreover, the electric and magnetic energy densities of a plane wave 
are equal, so the energy density contributed by our plane wave is just twice 
its electric energy density, and from equations (9.58) and (9.65) we infer that 
our wave contributes the time-averaged energy density 

5p = ^ ^ SA °J = ±w 2 e 0 (<L4 0 ) 2 , (9.66) 

2p 0 c z 

where the second equality uses po c 2 = 1/eo- Using this expression to elimi¬ 
nate 5A 0 from equation (9.64a), we obtain 


E P ^) = -A~ E N f\{E k \z\E N )\ 2 (9-67) 

w waves 


Let p(ui) be the power contained in all waves that have frequencies less than 
oj. In symbols 

PM = E <W- (9-68) 

waves with uj ' <oj 


Then 





(9.69) 


When we use this expression to replace the sum on the right side of equation 
(9.67) by an integral, and we use equation (9.64b) to replace d to with — 2dx, 
we obtain 


E p *w 



(E k - E N ) 2 \{E k \z\E N )\ 2 


dx 


dp sin 2 (xi) 
dw lo 2 x 2 


(9.70) 


We now let t become large and exploit equation (9.53) to evaluate the inte¬ 
gral. The result is 


E P ^) 


= t—t\(E k \z\E N )\ 2 
e 0 n 


dp 

dx> 


ui=(E N — E k )/h 


(9.71) 


The coefficient of t on the right of this equation gives the rate R at which 
transitions occur. When we express the cumulative energy density of the 
wave-field in terms of frequency v rather than angular frequency oj and use 
equation to eliminate eo in favour of the Bohr radius, the rate becomes 

R=J^\(E k \z\E N )\ 2 ^ . (9.72) 

°0 m e v -\E N -E k \/h 

When E k > Ejy, the negative-frequency term in equation (9.62) gives 
rise to excitations at an identical rate. Thus we have recovered from a dynam¬ 
ical argument Einstein’s famous result that stimulated emission of photons 
occurs, and that the coefficient B that controls the rate of stimulated emis¬ 
sion is equal to the absorption coefficient (Box 9.2). Einstein’s prediction 
of stimulated emission led 38 years later to the demonstration of a maser 
(§5.2.1) and 44 years later to the construction of the first laser by Theodore 
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Box 9.2: Einstein A and B Coefficients 


In 1916, when only the merest fragments of quantum physics were known, 
Einstein showed ( Verh. Deutsch. Phys. Ges. 18 , 318) that systems must 
be capable of both spontaneous and stimulated emission of photons, and 
that the coefficient of stimulated emission must equal that for absorption 
of a photon. He obtained these results by requiring that in thermal 
equilibrium there are equal rates of absorption and emission of photons 
of a given frequency v by an ensemble of systems. He considered a 
frequency v for which hv = A E, the energy difference between two states 
|1) and 12) of the systems. The rate of absorptions he assumed to be 
A a bs = B a Ni(dp/dv), where B a is the absorption coefficient, N\ is the 
number of systems in the state 11), and (dp/dv) is the energy density 
in radiation of frequency v. The rate of emissions he assumed to be 
N em = B e N2{dp/dv) + AN2, where B e is the coefficient for induced 
emission and A is that for spontaneous emission. Equating N a b s to N em 
yields 

0 = {B e N 2 - B a NA^ + AN 2 . 

In thermal equilibrium Ni = N2e hv ^ kT and dpfdv is given by the Planck 
function. Using these relations to eliminate N\ and dp/dv and then 
cancelling IV2, we find 


0 = (H e 


B a e hv/kT ) 


87 rhu 3 

C 3 (e hv /kT — l) 


-A. 


In the limit of very large T, Q hv l kT —» 1, so the factor multiplying the 
bracket with the Bs becomes large, and the contents of this bracket tends 
to B e — B a . It follows that these coefficients must be equal. We therefore 
drop the subscripts on them, take B out of the bracket, cancel the factors 
with exponentials, and finally deduce that 

A = 8irh(u / c) 3 B . (1) 


Maiman. 6 In view of this history, it’s a remarkable fact that a laser operates 
in the regime in which the electromagnetic field can be treated as a classical 
object, as we have done here. Emission of light by a humble candle, by con¬ 
trast, is an inherently quantum-mechanical phenomenon because it occurs 
through spontaneous emission. Our treatment does not include spontaneous 
emission because we have neglected the quantum uncertainty in the elec¬ 
tromagnetic field. This uncertainty endows the field with zero-point energy 
(§3.1), and spontaneous emission can be thought of as emission stimulated 
by the zero-point energy of the electromagnetic field. 

Using the argument given in Box 9.2, Einstein was able to relate the 
coefficient A of spontaneous emission to B. Einstein’s argument does not 
yield a numerical value for either A or B. Our quantum mechanical treatment 
has yielded a value for B , and with Einstein’s relation (eq. 1 in Box 9.2) 
between B and A we can infer the value of A: 

16 TT 2 hlS 3 ,o 

- (E k zE N ) 2 . 9.73 

c 6 aom e 

From this we can estimate the typical lifetime for radiative decay from an 
excited state of an atom. 

When the radiation density p is very small, the number N2 of atoms in 
an excited state obeys N 2 = —AN 2 (Box 9.2), so N 2 decays exponentially 

6 The word ‘laser’ is an acronym for “light amplification by stimulated emission”. 
Curiously Maiman’s paper (Nature , 187, 493 (I960)) about his laser was rejected by the 
Physical Review. 
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with a characteristic time A _1 . Unless some symmetry condition causes 
the matrix element in equation (9.73) to vanish, we expect the value of the 
matrix element to be ~ do- So the characteristic radiative lifetime of a state 
is 


- A -i - m e c2 A 1 

hv do 167r 2 ^ 


(9.74) 


For an optical transition, hv ~ 2eV, A ~ 650 nrn ~ 1.2 x 10 4 ao, and v ~ 
4.6 x 10 14 Hz, so r ~ 4 x 10 _8 s. It follows that ~ 10 7 oscillations of the 
atom occur before the radiation of energy causes the atom to slump into the 
lower state. 


9.3.3 Selection rules 

Equation (9.72) states that the rate of radiative transitions is proportional 
to the mod-square of the electric dipole operator ez. For this reason the 
approximation we made, that k • x <C 1, is called the electric dipole ap¬ 
proximation. 

There are important circumstances in which symmetry causes the matrix 
element of the dipole operator to vanish between the initial and final states. 
Transitions between such states are said to be forbidden in contrast to al¬ 
lowed transitions, for which the matrix element does not vanish. Some 
approximations were involved in our derivation of equation (9.72), so the 
transition rate does not necessarily vanish completely when the matrix ele¬ 
ment is zero. In fact, forbidden transitions often do occur, but at rates that 
are much smaller than the characteristic rate of allowed transitions (eq. 9.74) 
because the rate of a forbidden transition is proportional to terms that we 
could neglect in the derivation of equation (9.72). We now investigate rela¬ 
tions between the initial and final states that must be satisfied if the states 
are to be connected by an allowed transition. Such relations are called se¬ 
lection rules. The slower rate of forbidden transitions must be determined 
by either including the next term of the Taylor expansion of e lk x , or taking 
into account the perturbation /tbS • B that arises from the interaction of the 
intrinsic magnetic moment of an electron with the wave’s magnetic field. 

We are interested in matrix elements between states that are eigenstates 
of operators that commute with the Hamiltonian H that the atom would 
have if it were decoupled from electromagnetic waves. The Hamiltonian 
should include spin-orbit coupling as well as interaction with whatever steady 
external electric or magnetic fields are being applied. The operator in the 
matrix element is the component of the position operator parallel to the 
electric field of the radiation that is being either absorbed or emitted. 

Even in the presence of an external field, the angular-momentum parallel 
to the field, which we may call J z , commutes with H , so the kets of interest 
are labelled with m. Since [J z ,z] = 0, the ket z\E,m) is an eigenket of J z 
with eigenvalue m. It follows that (E, m\z\E', m') = 0 unless m = m!. This 
gives us the first selection rule listed in Table 9.1, namely that when the 
electric vector of the radiation is parallel to the imposed field, the quantum 
number m is unchanged by radiation. 

If we define x± = x ± iy, we have 

[J z ,x±] = h/± i(—hr) = ±x±. (9.75) 

It follows that x±\E,m) is an eigenket of J z with eigenvalue to ± 1, so 

(E,m\x\E',rri) = \{E, m\(x+ + X-)\E', m!) 

= 0 unless m! = m ± 1. 

Obviously the same result applies to the matrix element for y. Hence we 
have the second selection rule listed in Table 9.1: when the electric vector of 
the radiation is perpendicular to the imposed field, the quantum number to. 
changes by ±1. If the direction of observation is along the imposed field, the 
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Table 9.1 Selection rules 
j | j - j'\ < 1 but j = 0 t4 f = 0 

to |m — m’ | < 1; to = m! for E parallel to an external field; 

|m — to'| = 1 for photon emitted parallel to an external field 
l |2 — Z'| = 1 

s s = s' 

electric vector of the radiation must be perpendicular to the field. Hence, 
in this case to must change by ±1. In fact, to increases when a left-hand 
circularly-polarised photon is emitted in the positive 2 direction, and con¬ 
versely for the emission of a right-hand polarised photon. When the direction 
of observation is perpendicular to the imposed field, the electric vector of the 
radiation can be either perpendicular to the held, in which case to changes 
by ±1, or parallel to the held, and then m does not change. 

When there is no imposed held, to may be unchanged or change by ±1, 
and we can observe photons associated with any of these changes in to when 
observing from any direction. 

When there is no imposed held, J 2 commutes with H and the kets 
of interest are labelled with E, j and to. The selection rule for j can be 
obtained from the rules for adding angular momentum that were discussed 
in §7.5: (E, j, m\xk\E', j',m') vanishes unless it is possible to make a spin-j 
object by adding a spin-one object to a spin-j' object. For example, the 
matrix element vanishes if j = j' = 0 because spin-one is all you can get by 
adding a spin-one system to a spin-zero one. Subject to the selection rules on 
to just discussed, the matrix element does not vanish if j = 0 and j' = 1, or if 
j = 1 and j' = 1, because both a spin-zero system and a spin-one one can be 
obtained by adding two spin-one subsystems. The matrix element vanishes 
if j = 1 and j' = 3 because by adding spin-one and spin-three the smallest 
spin you can get is spin-two. In summary, the selection rule is | j — j'\ < 1 
except that j = 0 —> j’ = 0 is forbidden. 

The selection rules for j that we have just given follow from a powerful 
result of group theory, the Wigner-Eckart theorem. Unfortunately, a 
signihcant amount of group theory is required to prove this theorem. In 
Appendix J we give a proof of the selection rule for j that builds on the 
calculation involved in Problem 7.21. 

When spin-orbit coupling is weak, the total orbital angular momentum 
L 2 and the total spin angular momentum S 2 are constants of motion, so 
their quantum numbers l and s are likely to appear as labels in the kets. 
Since [x, S] = 0, it is clear that the selection rule for s is that it should not 
change. The selection rule for l was derived in Problem 7.21: \l — V | = 1. 

Problems 

9.1 A harmonic oscillator with mass to and angular frequency oj is per¬ 
turbed by 6H = ex 2 , (a) What is the exact change in the ground-state 
energy? Expand this change in powers of e up to order e 2 . (b) Show that 
the change given by first-order perturbation theory agrees with the exact 
result to 0(e) (c) Show that the first-order change in the ground state is 
| b) = —(e £ 2 /y/2hio)\E 2 )■ (d) Show that second-order perturbation theory 
yields energy change E c = —e 2 Ti/Am 2 ix 3 in agreement with the exact result. 

9.2 The harmonic oscillator of Problem 9.1 is perturbed by SH = ex. Show 
that the perturbed Hamiltonian can be written 

where X = x+e/mto 2 and hence deduce the exact change in the ground-state 
energy. Interpret these results physically. 
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What value does first-order perturbation theory give? From perturba¬ 
tion theory determine the coefficient b± of the unperturbed first-excited state 
in the perturbed ground state. Discuss your result in relation to the exact 
ground state of the perturbed oscillator. 

9.3 The harmonic oscillator of Problem 9.1 is perturbed by SH = ex A . 
Show that the first-order change in the energy of the n th excited state is 

SE = 3(2 n 2 + 2n + l)e (. (9.77) 

\2mco J 

Hint: express x in terms of A + A*. 

9.4 The infinite square-well potential V(x) = 0 for |a?| < a and oo for 
\x\ > a is perturbed by the potential SV = ex I a. Show that to first order in 
e the energy levels of a particle of mass m are unchanged. Show that even 
to this order the ground-state wavefunction is changed to 

i i ^ 

W*) = 7^ C0S (W 2 °) + n 2 El ^ a E (- 1 ) n/2 (n 2_ 1) 3 Mmrx/2a), 


where E\ is the ground-state energy. Explain physically why this wavefunc¬ 
tion does not have well-defined parity but predicts that the particle is more 
likely to be found on one side of the origin than the other. State with rea¬ 
sons but without further calculation whether the second-order change in the 
ground-state energy will be positive or negative. 

9.5 An atomic nucleus has a finite size, and inside it the electrostatic 
potential <f>(r) deviates from Ze/(Alter). Take the proton’s radius to be 
a p ~ 10 -15 m and its charge density to be uniform. Then treating the dif¬ 
ference between $ and Ze/(Aite^r) to be a perturbation on the Hamiltonian 
of hydrogen, calculate the first-order change in the ground-state energy of 
hydrogen. Why is the change in the energy of any P state extremely small? 
Comment on how the magnitude of this energy shift varies with Z in hydro- 
genic ions of charge Z. Hint: exploit the large difference between a p and ao 
to approximate the integral you formally require. 

9.6 Evaluate the Lande g factor for the case l = 1 , s = \ and relate your 
result to Figure 9.2. 

9.7 A particle of mass m moves in the potential V(x,y) = ^ mix 2 (x 2 + y 2 ), 
where u is a constant. Show that the Hamiltonian can be written as the 
sum H x + H y of the Hamiltonians of two identical one-dinrensional harmonic 
oscillators. Write down the particle’s energy spectrum. Write down kets 
for two stationary states in the first-excited level in terms of the stationary 
states | n x ) of H x and \n y ) of H y . Show that the n th excited level is n + 1 
fold degenerate. 

The oscillator is disturbed by a small potential H\ = Xxy. Show that 
this perturbation lifts the degeneracy of the first excited level, producing 
states with energies 2 hto ± Xh/2muj. Give expressions for the corresponding 
kets. 

The mirror operator M is defined such that for any state \tp), (x, y\M\ i/j) = 
(y,x\i/j). Explain physically the relationship between the states | i/j) and 
M\tp). Show that [M, Hi] = 0. Show that MH X = H y M and thus that 
[M, H] = 0. What do you infer from these commutation relations? 

9.8* The Hamiltonian of a two-state system can be written 


H = 


( A l +B l e 
\ B 2 e 



(9.78) 


where all quantities are real and e is a small parameter. To first order in e, 
what are the allowed energies in the cases (a) A i ^ A 2 , and (b) Ai = A 2 1 
Obtain the exact eigenvalues and recover the results of perturbation 
theory by expanding in powers of e. 
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Figure 9.3 The relation of input 
and output vectors of a 2 x 2 Hermi- 
tian matrix with positive eigenvalues 
Ai > A 2 . An input vector (A, Y) on 
the unit circle produces the output 
vector ( x , y) that lies on the ellipse 
that has the eigenvalues as semi¬ 
axes. 


9.9* For the P states of hydrogen, obtain the shift in energy caused by a 
weak magnetic field (a) by evaluating the Lande g factor, and (b) by use 
equation (9.28) and the Clebsch-Gordan coefficients calculated in §7.5.2. 

9.10 The 2 x 2 Hermitian matrix H has positive eigenvalues Ai > A 2 . The 
vectors (X, Y) and (x, y) are related by 


H 




Show that the points (AiX, A 2 Y) and (x, y) are related as shown in Figure 9.3. 
How does this result generalise to 3 x 3 matrices? Explain the relation of 
Rayleigh’s theorem to this result. 

9.11 We find an upper limit on the ground-state energy of the harmonic 
oscillator from the trial wavefunction i[>(x) = (a 2 + a: 2 ) -1 . Using the substi¬ 
tution x = a tan 9, or otherwise, show that 

pOO fOO pOO 

/ da’|f/'| 2 = ^7ra _3 / dxx 2 \i/j\ 2 = ra _1 / dxi/j*p 2 i/j = g7ra -5 

Jo Jo Jo 

(9.79) 

Hence show that (ip\H\ip) / is minimised by setting a = 2*/ 4 f', where t is 
the characteristic length of the oscillator. Show that our upper limit on Eq 
is Tiuj/\/2. Plot the the final trial wavefunction and the actual ground-state 
wavefunction and (a) say whether you consider it a good fit, and (b) how it 
might be adapted into a better trial wavefunction. 

9.12 Show that for the unnormalised spherically-symmetric wavefunction 
ip(r) the expectation value of the gross-structure Hamiltonian of hydrogen is 


(H) = 


2 m e 


dr 1 


dijj 


dr 


47re 0 


dr r |'i/’| ^ 


For the trial wavefunction ^ = e br show that 

= Ai 

2 m e 47reo ’ 



(9.80) 


and hence recover the definitions of the Bohr radius and the Rydberg con¬ 
stant. 

9.13* Using the result proved in Problem 9.12, show that the trial wave- 
function ipt = e~ b r ! 2 yields —8/(3tt)1Z as an estimate of hydrogen’s ground- 
state energy, where 1Z is the Rydberg constant. 

9.14 Show that the stationary point of (ip\H\il)) associated with an excited 
state of H is a saddle point. Hint: consider the state \ip) = cos6\k) +sin0|Z), 
where 9 is a parameter. 
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9.15 At early times (t ~ — oo) a harmonic oscillator of mass m and natural 
angular frequency u> is in its ground state. A perturbation SH = £xe~ l E 
is then applied, where £ and r are constants. 

a. What is the probability according to first-order theory that by late times 
the oscillator transitions to its second excited state, |2)? 

b. Show that to first order in 5H the probability that the oscillator tran¬ 
sitions to the first excited state, 11}, is 


P = 


7 t£ 2 t 2 
2mTiu> 


2 /2 


(9.81) 


c. Plot P as a function of r and comment on its behaviour as lot —> 0 and 

LOT —> OO. 

9.16 A particle of mass m executes simple harmonic motion at angular 
frequency to. Initially it is in its ground state but from t = 0 its motion is 
disturbed by a steady force F. Show that at time t > 0 and to first order in 
F the state is 

|V>, t) = e~ iEot/R \0) + ai e-' lElt/n \l) 


where 


ai = 


\/2 mhio 



Calculate (x) ( t ) and show that your expression coincides with the classical 
solution 

x(t)= [ d t'F{t')G(t-t'), 

Jo 

where the Green’s function is G{t — t') = sin[w(< — t')\/rruo. Show that a 
suitable displacement of the point to which the oscillator’s spring is anchored 
could give rise to the perturbation. 


9.17* A particle of mass m is initially trapped by the well with potential 
V(x) = — Vgd(x), where Vs > 0. From t = 0 it is disturbed by the time- 
dependent potential v(x,t) = —Fxe~ lult . Its subsequent wavefunction can 
be written 


| ip)=a(t)e l£ Wfi|o) + J dk { b k (t)\k,e) + c k (t)\k,o)}e lEfct/n , (9.82) 


where E 0 is the energy of the bound state |0) and Ek = h 2 k 2 /2m and |fc, e) 
and |fc, o) are, respectively the even- and odd-parity states of energy Ek (see 
Problem 5.17). Obtain the equations of motion 


ifr ja|0)e lEot / h + J d k (bk\k, e) + Ck\k, o)j e lEkt / h j 

= v |a|0)e _1 ' Eot / ?i + J dk (b k \k,e) + c k \k,o))e- lEkt/h 


(9.83) 


Given that the free states are normalised such that (fc',o|fc, o) = 6(k — k'), 
show that to first order in v, b k = 0 for all t, and that 


i F 

Ck(t) = — (fc,oM0)e 
n 


ifi fc t /2 sin(£lkt/2) 

n k /2 '■ 


u r. Ek ~ E ° 

where ilk = -t -w. 


(9.84) 

Hence show that at late times the probability that the particle has become 

f ree jg 

P„(t) = 2 ""f 2< l(t ’* |0>|2 . (9.85) 

h k n k =o 



224 


Problems 


Given that from Problem 5.17 we have 

(x|0) = y/Ke~ w \ x \ where K = — 6 and {x\k,o) = —^-sin(fcx), 

h \/TT 

(9.86) 

show that _ 

/ K 4k K 

~ (fc 2 + K 2 ) 2 ' (9 ' 87) 

Hence show that the probability of becoming free is 


„ ^ 8 hFH vWN 

il[ ) mEl (1 + E t /\E 0 \r 


(9.88) 


where Ef > 0 is the final energy. Check that this expression for Pf r is 
dimensionless and give a physical explanation of the general form of the 
energy-dependence of Pf r (t) 

9.18* A particle travelling with momentum p = Tik > 0 from — oo encoun¬ 
ters the steep-sided potential well V(x) = —Vo < 0 for |x| < a. Use the Fermi 
golden rule to show that the probability that a particle will be reflected by 
the well is 

V 2 

^reflect — ~j~J72 (2/to) . 

where E = p 2 /2m. Show that in the limit E Vo this result is consistent 
with the exact reflection probability derived in Problem 5.10. Hint: adopt 
periodic boundary conditions so the wavefunctions of the in and out states 
can be normalised. 

9.19* Show that the number states g{E) d E d 2 H with energy in (E, E+dE) 
and momentum in the solid angle d 2 0 around p = Kk of a particle of mass 
to that moves freely subject to periodic boundary conditions on the walls of 
a cubical box of side length L is 


g{E) dEd 2 n 



!^V2EdEdn 2 . 

h 3 


(9.89) 


Hence show from Fermi’s golden rule that the cross section for elastic scat¬ 
tering of such particles by a weak potential V (x) from momentum hk. into 
the solid angle d 2 0 around momentum hk' is 


der 


TO 


(2tt) 2 H 4 


d 2 n 


d 3 xe i( k - k ')' x v(x) 


2 


(9.90) 


Explain in what sense the potential has to be “weak” for this Born approx¬ 
imation to the scattering cross section to be valid. 

9.20 Given that ao = h/(am e c) show that the product agk of the Bohr 
radius and the wavenumber of a photon of energy E satisfies 


agk 


E 

am e c 2 


(9.91) 


Hence show that the wavenumber k a of an Hcc photon satisfies ao k a = 
and determine \ a /aQ. What is the connection between this result and our 
estimate that ~ 10' oscillations are required to complete a radiative decay. 
Does it imply anything about the way the widths of spectral lines from 
allowed atomic transitions varies with frequency? 

9.21 Equation (9.75) implies that x± act as ladder operators for J z . Why 
did we not use these operators in §7.1? 
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9.22 Given that a system’s Hamiltonian is of the form 

H = ik + V ^ < 9 - 92 > 

show that [x, [if, x\] = Ti 2 /m e . By taking the expectation value of this ex¬ 
pression in the state |fc), show that 


J2\(n\x\k)\ 2 (E n -E k ) 

n^k 


n 2 

2 m e ’ 


(9.93) 


where the sum runs over all the other stationary states. 

The oscillator strength of a radiative transition |fc) —► |n) is defined 
to be 

Orrj 

fkn = ^{E n - E k )\(n\x\k)\ 2 (9.94) 

Show that Y^ n fkn = 1. What is the significance of oscillator strengths for 
the allowed radiative transition rates of atoms? 
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Helium and the periodic table 


In this chapter we build on the foundations laid by our study of hydrogen in 
Chapter 8 to understand how the atoms of heavier elements work. Most of 
the essential ideas already emerged in Chapter 8. In fact, only one impor¬ 
tant point of principle needs to be introduced before we can move down the 
periodic table understanding why elements have the spectra and chemistry 
that they do. This outstanding issue is the remarkable implications that 
quantum mechanics carries for the way in which identical particles interact 
with one another. We shall be concerned with the case in which the parti¬ 
cles are electrons, but the principles we elucidate apply much more widely, 
for example, to the dynamics of the three quarks that make up a proton or 
neutron, or the two oxygen atoms that comprise an oxygen molecule. 


10.1 Identical particles 

Consider a system that contains two identical spinless particles. Then a 
complete set of amplitudes is given by a function ip of the coordinates x and 
x' of the particles: the complex number ipfx., x') is the amplitude to find one 
particle at x and the other particle at x'. What’s the physical interpretation 
of the number ipfx !, x)? It also gives the amplitude to find particles at x and 
x'. If the particles were not identical if one where a pion the other a kaon, 
for example - finding the pion at x and the kaon at x' would be a physically 
distinct situation from finding the kaon at x and the pion at x'. But if 
both particles are pions, ip(x,x.') and ip(x',x) are amplitudes for identical 
physical situations. Does it follow that ip(x,x') = ip(x',x)'? No, because 
experimentally we can only test the probabilities to which these amplitudes 
give rise. So we can only safely argue that the probability ^(x, x')| 2 must 
equal the probability (^(x', x)| 2 , or equivalently that 

^(x,x') = e I ^(x',x), (10.1) 

where 0 is a real number. This equation must hold for any x and x'. So the 
function ip has the property that if you swap its arguments, you increment 
its phase by <p . Specifically 


ip(x\x) = e'^ip(x } x'). 


(10.2) 
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Substituting this equation into the right side of equation (10.1), it follows 
that 

ip(x, x') = e 2l ^tp(x, x'), (10.3) 

which implies that either <p = 0 or (p = tt. Thus we have shown that the 
wavefunction for a system of two identical spinless particles has to be either 
symmetric or antisymmetric under interchange of the particles’ coordinates. 

Consider now the case of two spin-s particles, which might, for example, 
be electrons (s = or photons (s = 1). A complete set of amplitudes would 
be the amplitude for one particle to be at x in the state that has eigenvalue 
m of S z , and the other particle to be at x' with eigenvalue m!. Let the 
complex number V’mm^x, x') denote this amplitude - that is, let the first 
subscript on ip give the orientation of the spin of the particle that is found 
at the position given by the first argument of ip. Then the possibly different 
amplitude ip m 'm(x', x) is for the identical physical situation. Hence 

i'm(x',x) = e 1 ^_-(x,x'). (10.4) 

This equation must hold for all m, m! and x, x'. So swapping the subscripts 
on ip at the same time as swapping the arguments, is equivalent to multiply¬ 
ing through by e'A Swapping a second time leads to 

ip mm >(x,x') =e 2l Vmm'(x,x'), (10.5) 


so, as in the case of spin-zero particles, either (p = 0 or <p = 7r. It turns out 
that there is no change of sign if the particles are bosons (s = 0,1, 2 ,...), and 
there is a change of sign if the particles are fermions (s = ..). That is 


'Ipmm' (x, X ) 


—V , m'm(x , ,x) for fermions 
(x / , x) for bosons. 


( 10 . 6 ) 


These relations between amplitudes are said to arise from the principle of 
exchange symmetry between identical particles. 

Generalisation to the case of N identical particles If our system 
contains N identical fermions, the wavefunction will change its sign when we 
swap the arguments (both spin quantum numbers and spatial coordinates) 
associated with any two slots in the wavefunction. Similarly, if the system 
contains N bosons, the wavefunction will be invariant when we swap the 
arguments associated with any two slots. 


10.1.1 Pauli exclusion principle 

An immediate consequence of the wavefunction of fermions being antisym¬ 
metric under a swap of its arguments, is that there is zero probability of 
finding two fermions with their spins oriented in the same way at the same 
location: since ^ mm (x, x) = —ip mm (x,x), the amplitude ^ mm (x, x) must 
vanish. Since wavefunctions are continuous functions of position, and their 
spatial derivatives are constrained in magnitude by the particles’ momenta, 
i/) mro (x, x') can vanish at x = x' only if it is small whenever the two argu¬ 
ments are nearly equal. Hence, fermions with similarly oriented spins avoid 
each other; they are anticorrelated. This fact has profound implications 
for atomic and condensed-matter physics. 

If the particles’ spins have different orientations, there can be a non¬ 
zero amplitude of finding them at the same location: from ip mm >(x,x) = 
— ipm'm (x, x) it does not follow that the amplitude ip mm '(x,x) vanishes. 

The principle of exchange symmetry arises as a constraint on amplitudes, 
but we now show that it has implications for the structure of the underlying 
states. Let (|?i)} be a complete set of states for a single fermion - so the 
label n carries information about both the electron’s motion in space and 
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its orientation (spin state). Then from §6.1 we have that any state of an 
electron pair can be expanded in the form 1 

W=£w|»)|n'>- (10.7) 

nn' 


Multiplying through by (x,x', m, m'\ we obtain the amplitude x') 

that is constrained by exchange symmetry: 

VW'(x,x') = (x,x / ,TO,TO , |'i/’) = ^ (x, m\n) (x', m!\n!) . (10.8) 

nn' 


We now swap x, to with x',m' throughout the equation and add the result 
to our existing equation. Then by exchange symmetry the left side vanishes, 
and we have 

0 = ’^2 a nri (x, to| n) (x' ,m'\n') + ^ <w(x', m'|n)(x, ra|n'). (10.9) 

nn' nn' 


In the second sum we may swap the labels n and n! (since they are be¬ 
ing summed over), and we may also reverse the order of the amplitudes 
(x', m'\n') and (x, m\n) (because they are mere complex numbers). Then we 
have 


0 = ^<w(x, m\n)(x' ,m'\n') + ^ a n » n (x, m\ri)(x', m'\n') 

nn' n'n 

= ^(x,?n|n)(x',TO , |n / )(a rm / +a„/„) 

nn' 

= (x,x',to,to'| ^ |n)|n')(<w + a„/„). 

nn' 


( 10 . 10 ) 


Since this equation holds for arbitrary x, x', m, m! it follows that the sum 
vanishes, and from the linear independence of the basis kets |n}|n') it follows 
that the coefficient of each such ket vanishes. Hence we have 


®nn' — ®n'n* 


( 10 . 11 ) 


In particular a nn vanishes so there is zero amplitude to find that both 
fermions are in the same single-particle state |n). This result is known as 

the Pauli exclusion principle. 

The Pauli exclusion principle ensures that any expansion of the form 
(10.7) involves at least two terms. When there are only two terms, equation 
(10.11) ensures that a nn > = —a n ’ n = ±l/\/2, so equation (10.7) reduces to 


= ±^(|n)|n')-|n»). 


( 10 . 12 ) 


In §6.1 we saw when the wavefunction of a pair of particles is a non-trivial sum 
over products of wavefunctions for each particle, the particles are correlated. 
Hence the Pauli exclusion principle implies that identical fermions are always 
correlated. 


1 Equation (10.7) implies that we can distinguish the two electrons - electron “1” is 
in the state |n), while electron “2” is in state | n'). Physically this is meaningless. What 
we are doing is writing down states of distinguishable particles that are consistent with 
the restrictions imposed on states of pairs of indistinguishable particles. 
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10.1.2 Electron pairs 

We now specialise to the important case of identical spin-half particles, such 
as electrons. For complete specification of the quantum state of a single 
electron we must give the values of two functions of x, which can be the 
amplitudes ip± ( x ) to find the electron at x and oriented such that S z returns 
±1. These functions form a two-component wavefunction: 


( X IV>) 


( V>+( x )\ 
VV’-(x)/ ' 


(10.13) 


Similarly, to specify completely the state of a pair of electrons, four functions 
of two locations are required, namely for to, to' = ±. Thus an 

electron pair has a four-component wavefunction 


/VMh-( x , x ')\ 

^-+( x , x ') I 
t4-( x , x ') 

W-( x , x ')/ 


(10.14) 


We often wish to consider the states of an electron pair in which it is the pair 
rather than its individual members that has well-defined spin in §7.5.1 we 
investigated the states of a hydrogen atom in which the atom rather than 
its constituent particles has well-defined spin. Our derivation of the results 
obtained there relied only on the properties of the spin raising and lowering 
operators S ±, so they are valid for any pair of spin-half particles, including 
an electron pair. Multiplying equation (7.153) through by (to, to' | we see 
that when the pair has unit spin and S z yields 1, the only non-vanishing 
amplitude 'i/’mm' is '*/’++• Hence in this state of the pair the wavefunction is 


< x , x'lV', 1,1> 


V>(( x , x ') 



(10.15) 


where i/>l = ip++ is required by exchange symmetry to be an antisymmetric 
function of x and x': 


i( x ', x ) = -^i( x , x ')- 


(10.16) 


Similarly, from equation (7.154) it follows that when the pair has s = 1 
but to = 0 , there are equal amplitudes to find the individual spins —I- and 


H—, so 


< x , x 'IV’, i,o) 


V’iC^x') 



(10.17) 


where ip °(x, x') = ij) _|_(x, x') = if). |_(x, x'). Swapping the labels x and x' 

on both sides of the equivalence and using first equation (10.6) and then 
equation (10.17), yields 


^i( x ', x ) = V’-+( x ', x ) = -V’+-( x , x ') = -V’i( x , x ')- (10.18) 


Thus ipi like ip\ is an antisymmetric function of its arguments. 

Similarly, from equation (7.155) we infer that when the pair has s = 1 
and to = — 1 its wavefunction is 


/° 

0 

0 

Vi 


( x , x '|^, 1,-1) = V>1 ^x.x') 


(10.19) 
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where ip^ 1 is an antisymmetric function of its arguments. 

Finally we must consider the spin-zero state of the pair. By equation 

(7.156) it is associated with values of the amplitudes tp _|_ and i /q_that are 

equal in magnitude and opposite in sign. So we can write 


(x,x , |V’,0,0) = ipopx,*') 



( 10 . 20 ) 


In this case use of the exchange principle yields 

^o(x,x') ee ^_+(x,x') = -Vm— (x',x) = ^ 0 (x',x) (10.21) 

so ipo, in contrast to the previous functions ip™, is a symmetric function of 
its arguments. 

The spin-one states of an electron pair are generally called triplet 
states while the spin-zero state is called the singlet state. We have seen 
that the wavefunction of a triplet state is an antisymmetric function of x 
and x', while wavefunction of the singlet state is a symmetric function of x 
and x'. We saw above that electrons that have equal components of angular 
momentum parallel to the z axis avoid each other. We now see that this 
mutual avoidance is a general characteristic of all the triplet states. 

One way of constructing a function of two variables is to take the product 
u(x)v(x') of two functions u and v of one variable. Unless u = v, this product 
is neither symmetric nor antisymmetric under interchange of x and x', so it 
cannot be proportional to the wavefunction of either a triplet or a singlet 
state. To achieve such proportionality, we must extract the symmetric or 
antisymmetric part of the product. That is, for appropriate u and v we may 
have 


Vh m (x,x') 


V2 


{m(x)u(x') 


V>o(x,x') 


- u(x')v(x)} (to =1,0, — 1) 
-^{u(x)u(x') + u(x>(x)}. 


( 10 . 22 ) 


In the case u = v, the triplet wavefunctions are identically zero but the sin¬ 
glet wavefunction can be non-vanishing; that is, two distinct single-particle 
wavefunctions are required for the construction of a triplet state, while just 
one single-particle wavefunction is all that is required for a singlet state. 

Wavefunctions of the form (10.22) are widely used in atomic physics 
but one should be clear that it is an approximation to assume that a two- 
electron wavefunction can be written in terms of just two single-particle 
wavefunctions; any wavefunction can be expanded as a sum of products of 
single-particle wavefunctions, but the sum will generally contain more than 
two terms. 


10.2 Gross structure of helium 

About a quarter of the ordinary matter in the Universe is in the form of 
helium, the second simplest element. The tools that we now have at our 
disposal enable us to build a fairly detailed model of these important atoms. 
This model will illustrate principles that apply in all many-electron atoms. 

We seek the stationary states of the Hamiltonian that describes the 
electrostatic interactions between the two electrons and the alpha particle 
that make up a helium atom. This Hamiltonian is (cf. eq. 8.1) 


H = 


P n 


Pi 


P\ 


2m n 2 TO e 2?7l e 47T£o 


Xl - x n 


X 2 - X n 


Xl - X 2 


(10.23) 
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where x,; and x n are the position operators of the i th electron and the nucleus, 
respectively, and Pi and p n are the corresponding momentum operators. We 
shall work in the atom’s centre-of-mass frame and neglect the small displace¬ 
ment from the origin and kinetic energy that the nucleus has in this frame. 
With this approximation, H can be written as the sum of two hydrogenic 
Hamiltonians with Z = 2 (cf. eq. 8.10) and the term that describes the 
mutual electrostatic repulsion of the electrons 


H = ff H (pi,xi) + Hh(P 2 ,x 2 ) + --j—-,, (10.24a) 

47re 0 | x i ~ x 2 | 


where 


#h(p, x ) 



2 e 2 

47 re 0 |x|' 


(10.24b) 


We cannot determine the eigenkets of H exactly, so we resort to the approx¬ 
imate methods developed in the previous chapter. 


10.2.1 Gross structure from perturbation theory 

Our first approach is to use the perturbation theory of §9.1. In §8.1 we found 
the eigenfunctions of H^. These proved to be products u l n {r)Y] n (9 1 </>) of the 
radial eigenfunctions u l n derived in §8.1.2 and the spherical harmonics 
derived in §7.2.3. From the work of §6.1 it follows that the eigenfunctions of 
the operator 

Flo = U H (pi,x 1 ) + H r H (P 2 ,x 2 ) (10.25) 

are products 


^( Xl )^;,(x 2 ) = ^(r 1 )Yr(0i,^)^(r 2 )Y i T'(0 2 ,^ 2 ), 


(10.26) 


where n and n' are any positive integers. From equation (8.27) the corre¬ 
sponding eigenvalues are 


Eq — — 47 ^ 



+ 



(10.27) 


The ground-state wavefunction of Hq will be a product of the ground-state 
eigenfunctions (47r) _1 / 2 u5(r) of Hu, where the function U® is given by equa¬ 
tion (8.36) with Z = 2. From equation (10.27) the ground-state energy of 
H 0 is 

E 0 = -8K = —108.8 eV. (10.28) 

The Hamiltonian (10.23) commutes with all spin operators because it 
makes no reference to spin. Therefore we are at liberty to seek eigenfunctions 
of H that are simultaneously eigenfunctions of the total spin operators S 2 and 
S z . We have seen that these eigenfunctions are either singlet or triplet states 
and are either symmetric or antisymmetric functions of the spatial variables. 
The ground-state wavefunction of Hq is an inherently a symmetric function 
of xi and x 2 , so the ground-state is a singlet. The first-order contribution 
to the ground-state energy is the expectation value of the perturbing part of 
the Hamiltonian (10.24.) This expectation value is 


A E = 


47re 0 


Dq where Dq= d^xi d ,i x 2 


l^l0( x l)| 2 |^10( x 2)| S 

I x l - x 2 | 


(10.29) 


Box 10.1 describes the evaluation of the six-dimensional integral Dq. We 
find that A E = 1 1Z , so our estimate of the ground-state energy of helium is 
E = E 0 + A E = -^-IZ = — 74.8eV. The experimentally measured value is 
-79.0 eV. 
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Box 10.1: Evaluating the 
integral Dq in equation (10.29) 

We express the two position vectors in spherical polar coordinates. Since 
xi is a fixed vector during the integration over X 2 , we are at liberty 
to orient our z axis for the X 2 coordinate system parallel to xi. Then 
|xi — X 2 I = \/l r i + r 2 — 2rir2 cos $ 2 ! is independent of fa. The mod 
square of 'FJq does not depend on <f>, so the integrand is independent of 
fa and we can trivially integrate over fa. What remains is 


D 0 = -T- / d 3 X! |^ 0 ( Xl )| 2 / dr 2 d6» 2 - 


where az = ao/2. Now 


l r i + r\- 2nr 2 cos 62 ] 


_1_d_ 

nr 2 dd 2 


r\ sin 0 2 e _2r2 ^“ z 

/\r\ + r| — 2r!r 2 cos# 2 | 

\J\r\ +r% - 2rir 2 cos0 2 |, 
\n +r 2 | - In. - r 2 | 


f* sin 6*2 d 0 2 _ |n + r 2 | — |n - r 2 | 

Jo i/| r\ + r% — 2rir 2 cos0 2 | r i r 2 

_ f 2/n for n > r 2 
\ 2 /r 2 for n < r 2 . 

After using this expression in equation (1), we have to break the integral 
over 7*2 into two parts, and have 

D 0 = A J cl 3 Xl | 1 ? 0 (xi )| 2 jjT dr 2 7 ^ e ~ 2r2/aZ + l dr 2 r 2 e- 2r2 / az | 

= — / d 3 xi|^? 0 (xi)| 2 — | 2 -e - pl (2 + pi)}, 

az J Pi 

where pi = 2r\/az- The integral over xj is relatively straightforward: 
given the normalisation of the spherical harmonics, we simply have to 
integrate over rq. We transform to the scaled radius p\ and find 


■J dpi pie Pl (2 — e Pl (2 + pi)} = 


10.2.2 Application of the variational principle to helium 

We can use the variational principle (§9.2) to refine our estimate of helium’s 
ground-state energy. Our estimate is based on the assumption that the elec¬ 
trons’ wavefunctions are those that would be appropriate if the electrons did 
not repel one another. Suppose we could somehow switch off this repulsion 
without affecting the attraction between each electron and the alpha particle. 
Then the electrons would settle into the wavefunctions we have assumed. If 
we then turned the electric repulsion back on, it would push the electrons 
apart to some extent, and the atom would become bigger. This thought 
experiment suggests that we might be able to obtain a better approximation 
to the electrons’ wavefunctions by increasing the characteristic lengtliscale 
that appears in the exponential of a hydrogenic ground-state wavefunction 
(eq. 8.36) from ag to some value a. The variational principle assures us that 
the minimum value with respect to a of the expectation value of H that we 
obtain with these wavefunctions will be a better estimate of the ground-state 
energy than the estimate we obtained by first-order perturbation theory. 

Consider, therefore, the expectation value of the Hamiltonian (10.24a) 
for the case in which the electronic wavefunction is a product of hydrogenic 
wavefunctions with ao replaced by a. From the work of the last subsection we 
already know the value taken when a = ao- Moreover, the expectation value 
is made up of five terms, two kinetic energies, and three potential energies, 
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and by dimensional analysis it is clear how each term must scale with a: the 
kinetic energies scale as a~ 2 because p is proportional to the gradient of the 
wavefunction, which scales like a -1 , while the potential energy contributions 
scale as a” 1 since they explicitly have distances in their denominators. We 
know that when a = a o and the wavefunctions are hydrogenic, the expec¬ 
tation value of the sum of hydrogenic Hamiltonians in equation (10.24a) is 
—87?, and we know from the virial theorem (eq. 2.93) that this overall energy 
is made up of 87? kinetic energy and —167?. of potential energy. We saw above 
that when a = do the electrostatic repulsion of the electrons contributes §7? 
of potential energy. Bearing in mind the way that these kinetic and potential 
energies scale with a it follows that for general a, the expectation value of 
helium’s Hamiltonian is 

{H) a = lZ{8x 2 — (16 — |)x} where x = —. (10.30) 

The derivative of ( H) a with respect to x vanishes when x = ||. When we 
insert this value of x into equation (10.30) we find our improved estimate of 
helium’s ground-state energy is — A(3/2) 6 7? = 77.4eV. As was inevitable, 
this value is larger than the experimentally measured value of —79.0 eV. But 
it is significantly closer than the value we obtained by first-order perturbation 
theory. 

An important indicator of the chemical nature of an element is the 
magnitude of the energy required to strip an electron from an atom, which 
is called the element’s ionisation energy. In the case of hydrogen, this 
energy is simply the binding energy, 13.6 eV. In the case of helium it is the 
difference between the binding energies of the atom and the ion He + that 
remains after one electron is stripped away. Since the He + ion is hydrogenic 
with Z = 2, its binding energy is 47? = 54.4 eV, so the ionisation energy 
of helium is 79.0 — 54.4 = 24.6 eV. This proves to be the largest ionisation 
energy of any atom, which makes helium perhaps the least chemically active 
element there is. 

10.2.3 Excited states of helium 

We now consider the low-lying excited states of helium. Given our success 
in calculating the ground-state energy of helium with the aid of hydrogenic 
wavefunctions, it is natural to think about the excited states using the same 
hydrogenic language. Thus we suppose that the electronic wavefunction is 
made up of products of single-particle wavefunctions. We recognise that the 
single-particle wavefunctions that should be used in these products will differ 
slightly from hydrogenic ones, but we assume that they are similar to the 
hydrogenic ones that carry the same orbital angular momentum and have 
the same number, n— 1, of radial nodes. Hence we can enumerate the single¬ 
particle wavefunctions by assigning the usual quantum numbers n and l to 
each electron. We expect to be able to obtain reasonable estimates of the 
energies of excited states by taking the expectation value of the Hamiltonian 
for hydrogenic states. 

In the first excited state it is clear that one of the electrons will have been 
promoted from its n = 1 ground state to one of the n = 2 states. From our 
discussion of shielding in §8.1.3, we expect that the state with l = 0 will have 
less energy than any other n = 2 state. Thus we seek to construct the first 
excited state from a product of the hydrogenic ground-state wavefunction 
4/i(x) and the wavefunction 4^(x) for n = 2, l = 0, which is also spherically 
symmetric. Since we are working with distinct single-particle states, we can 
construct both singlet and triplet states as described in §10.1.3. The spatial 
part of the wavefunction ('0™ of ipo) will be constructed from the product 
4 /14/2 symmetrised as described by equation (10.22). Since the two possible 
ways of symmetrising the product differ only in a sign, we defer choosing 
between them and make our formulae valid for either case, putting the sign 
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for the singlet state on top. We now have to calculate 

( H ) = ?f d 3 xdV {^(x)^(x') ± ^(x')^(x)} 

x H {4'i(x)4' 2 (x') ± 4'i(x')4' 2 (x)} . 


(10.31) 


When we substitute for H from equation (10.24a), integrals over terms such 
as 4 ');(x)4'2(x , )17h v I , i(x , )'I , 2(x) arise, where H H is the hydrogenic operator 
that appears in equations (10.24). The orthogonality of and 4Q causes 
these integrals to vanish, because H h contains either x or x', but not both 
operators so there is always an integral of the form 0 = / d 3 x T t(x)T 2 (x). 
The integral over 4'J(x)4'2(x , )i7H4'i(x)4' 2 (x') evaluates to either —A1Z or 
—TZ depending on whether Hu contains x or x'. Hence 

e 2 

(H) = -5K+- - {D±E}, (10.32a) 

47Te 0 

where D and E are, respectively, the direct and exchange integrals: 


D = 

E = 


d 3 xd 3 x' 


d 3 xd 3 x' 


I^' 1 ( x )4' 2 (x / )| 2 

|x-x'| 

Tt(x)T 2 (x)^(x , )4'i(x') 


(10.32b) 


Since both and *F 2 are spherically symmetric functions of their arguments, 
both integrals can be evaluated by the technique described in Box 10.1. After 
a good deal of tedious algebra one discovers that ( H) = —(56.6 =F 1.2) eV, 
where the upper sign is for the singlet state. The experimentally measured 
values are —(58.8 T 0.4) eV. Hence perturbation theory correctly predicts 
that the triplet state lies below the singlet state. 

The differences between our perturbative values and the experimental 
results arise because the hydrogenic wavefunctions we have employed are not 
well suited to helium. The deficiency is particularly marked in the case of 
the n = 2 wavefunction because the nuclear charge is significantly shielded 
from the outer electron, so the n = 2 wavefunction should extend to larger 
radii than the hydrogenic wavefunction we have employed, which assumes 
that the electron sees the full nuclear charge. Consequently, we have over¬ 
estimated the overlap between the two wavefunctions: the extent to which 
the wavefunctions permit the electrons to visit the same place. Because our 
wavefunctions have unrealistically large overlap, they yield values for both D 
and E that are too large. The exchange integral is particularly sensitive to 
overestimation of the overlap because it vanishes when there is no overlap, 
which D does not. Thus it is entirely understandable that our treatment 
yields binding energies that are insufficiently negative, and a singlet-triplet 
splitting that is too large. 

The sensitivity of the singlet-triplet splitting to wavefunction overlap 
leaves a clear imprint on the energy-level structure of helium that is shown 
in Figure 10.1: the separations of corresponding full (singlet) and dotted 
(triplet) lines diminishes as one goes up any column (increasing n) or from 
left to right (increasing l). Quantitatively, the singlet-triplet splitting when 
the excited electron is in a n = 2, l = 1 state (bottom of second column), 
rather than the n = 2, l = 0 state that we have just investigated (bottom of 
the first column), is 0.2eV rather than 0.8eV because, as we saw in §8.1.2, 
the l = 1 state has smaller amplitudes at the small radii at which the n = 1 
state has large amplitudes. 

We have discussed the splitting between singlet and triplet states in the 
case in which both the single-particle wavefunctions employed are spherically 
symmetric, so the wavefunctions are entirely real. The analysis for wavefunc¬ 
tions that have l ^ 0 is significantly more involved, but the essential result 
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Figure 10.1 Excited states of helium for n < 6 and l < 3. Energies are given with respect 
to the ground-state energy, and the line at the top shows the ionisation energy. Full lines 
show singlet states and dotted lines show triplet states. Fine structure splits the triplet 
states with l > 1 but the splittings are much too small to show on this scale - the largest 
is 0.00012 eV. 


is the same because the exchange integral E is always real (Problem 10.3) 
and positive. We can see that E is positive as follows. The exchange integral 
is dominated by the region x ~ x' in which the denominator is small. In 
this region the numerator does not differ much from | \I /1 (x) 2 (x) | 2 , so it is 
positive. Hence E is positive. Thus it is quite generally true that the triplet 
states lie below the corresponding singlet state. 

In our discussion of spin-orbit coupling in §8.2.1 we saw that the energy 
scale of that coupling is ~ jZ 2 a 4 m e c 2 (eq. 8.76b). For helium this evaluates 
to ~ 0.006 eV, which is two orders of magnitude smaller than the singlet- 
triplet splitting. Moreover, we found that the coupling vanishes for states 
with Z = 0, so it should vanish in the first excited state of helium. The 
singlet-triplet splitting is large because it has an electrostatic origin, rather 
than being a mere relativistic effect: a triplet state has less energy because 
in it the electrons are anticorrelated (§10.1.1). 

It is commonly stated that on account of this anticorrelation the energy 
of electrostatic repulsion between the electrons is smaller in triplet than in 
singlet states. This is false: the inter-electron potential energy is larger in 
the triplet than the singlet state. 2 The reason the triplet has lower energy 
is because it places the electrons closer to the nucleus than the singlet does. 
Moving the electrons towards the nucleus and thus towards one another nat¬ 
urally increases the energy of electron-electron repulsion, but this increase 
is outweighed by the lowering of the negative electron-nucleus energy. The 
quantitative results we obtained above should not be used to evaluate the 
inter-electron energy because they are based on hydrogenic wavefunctions, 
which provide a poor approximation to the true wavefunction. As we saw 
in §9.2, even a poor approximation to the wavefunction of a stationary state 
yields a useful approximation to the energy of that state because the expec¬ 
tation value of H is stationary in the neighbourhood of a stationary state. 
But the expectation value of a single term in H , such as the inter-electron 
potential energy, is not extremised by a stationary state, so the error in it 
will be of order the error in the wavefunction. In particular, to obtain a value 
that is accurate to first order in the perturbation, it is mandatory to use a 
wavefunction that is correct to first order, whereas we used the zeroth-order 
wavefunctions. Because the electrons do a better job of keeping out of each 
other’s way in the triplet state, in that state they can cohabit in a smaller 
volume, where the attraction of the nucleus is stronger. On account of this 

2 B. Schiff, H. Lifson, C.L. Pekeris & P. Rabinowitz, Phys. Rev.. 140, A1104, (1965) 
find the inter-electron energy to be 6.80 eV in the singlet state and 7.29 eV in the triplet 
state. 
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Box 10.2: Spectroscopic Notation 


Standard spectroscopic notation presumes that l and s, the total orbital 
and spin angular momenta, are good quantum numbers. The electronic 
configuration is a specification of the principal (n) and orbital angu¬ 
lar momentum (/) quantum numbers of the individual electrons of the 
outermost shell. Within a configuration a spectroscopic term speci¬ 
fies definite values for the total orbital l and spin s angular momenta of 
the outer electrons. Within each term a fine-structure level specifies 
a definite value for the total electronic angular momentum j. Within 
a fine-structure level may be distinguished different hyperfine levels 
that differ in total angular momentum /. The letters S, P , D , F denote 
l = 0,1,2,3, respectively. 


A typical configuration is denoted 2s2p 3 meaning one electron has n = 2, 
1 = 0 , and three electrons have n = 2, l = 1. 

Terms are denoted by ( 2s+1 ^, ) -; for example A P \/2 means s = l = 1, 



effect, the true singlet and triplet wavefunctions differ by more than just 
a change of sign; in equation (10.31) the functions Ti and T 2 should also 
change between the singlet and triplet cases. 

The singlet-triplet splitting in helium reflects destructive interference 
between the amplitudes for the two electrons to be simultaneously at the 
same place, and it is very much a quantum-mechanical effect. Through¬ 
out the periodic table, this mechanism gives rise to large energy differences 
between atomic states that differ only in their spin. These differences make 
ferromagnetism possible, and thus provide us with the dynamos, power trans¬ 
formers and electric motors that keep our civilisation on the move. 

10.2.4 Electronic configurations and spectroscopic terms 

The ground state of helium has neither spin nor orbital angular momentum. 
In conventional spectroscopic notation (Box 10.2) it is designated Is 2 , which 
implies that it has two electrons in the n = 1 S state. A related notation is 
used to indicate the spin, orbital and total angular momentum of the entire 
atom. In this system the ground state is designated 1 So- The superscript 
1 implies that the state is a spin-singlet because there is zero spin angular 
momentum. The S implies that there is no orbital angular momentum, and 
the subscript 0 implies that there is zero total angular momentum. 

The lowest dotted line in Figure 10.1 represents a triplet of excited 
states. These have the electronic configuration ls2s because there is an 
electron in the n = 1, l = 0 state and one in the n = 2, s = 0 state. 
They form the spectroscopic term 3 Si because the angular momenta of 
the whole atom are given by s = 1, l = 0 and j = 1. 

Just above this triplet of states comes the singlet state that has the same 
electronic configuration ls2s but which forms the distinct spectroscopic term 

Next come four spectroscopic terms that both have the electronic con¬ 
figuration ls2p: the most energetic of these terms is the singlet 1 P\, which is 
a set of three quantum states that have exactly the same energy but differ¬ 
ent orientations of the one unit of total angular momentum. Below this are 
three terms that have very similar energies: 3 Pq , 3 P\ and 3 P 2 . These terms 
differ from one another in the degrees of alignment of the spin and orbital 
angular momenta. In the 3 Po term the angular momenta are anti-parallel, 
with the result that the atom has zero angular momentum overall, while in 
the 3 P 2 term the angular momenta are parallel, so the atom has two units of 
angular momentum. There is just one quantum state in the 3 Pq term, and 
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five quantum states in the 3 P 2 term. The small energy differences between 
the 3 Pj terms are due to spin-orbit coupling. 

Spectrum of helium The selection rules listed in Table 9.1 include As = 
0, so in Figure 10.1 transitions between full and dotted levels are forbidden. 
Hence, an atom which is excited into one of the upper triplet states will 
cascade down through triplet states until it reaches the 3 Si level at the 
bottom of the triplet hierarchy. The states in this level are metastable 
because they can decay radiatively only by making the forbidden transition 
to the 3 So ground state, which takes appreciable time. Table 9.1 includes 
the rule Al = ±1, so transitions are only allowed between states that lie in 
adjacent columns, and the excited singlet state that is designated 1 Sq is also 
metastable. 


10.3 The periodic table 

The understanding of atomic structure that we have gained in our studies of 
hydrogen and helium suffices to explain the structure of the periodic table 
of the elements. 

10.3.1 From lithium to argon 

Imagine that we have a helium atom in its first excited state and that we 
simultaneously add a proton to the nucleus and an electron to the vacancy 
with principal quantum number n = 1 that arose when the atom was put 
into its excited state. After making these changes we would have a lithium 
atom in its ground state. The effects on the outermost electron of adding the 
positively charged proton and the negatively charged electron might be ex¬ 
pected to largely cancel, so we would expect the ionisation energy of lithium 
to be similar to that of a helium atom that’s in its first excited state. This 
expectation is borne out by experimental measurements: the ionisation en¬ 
ergy of once excited helium is 4.77 eV while that of lithium in its ground 
state is 5.39 eV. Thus the energy required to strip an electron from lithium 
is smaller than that required to take an electron from hydrogen or helium 
by factors of 2.5 and 4.6, respectively. The comparative ease with which 
an electron can be removed from a lithium atom makes compounds such as 
LiH stable (Problem 10.5). It also makes lithium is a metal by making it 
energetically advantageous for each atom in a lithium crystal to contribute 
one electron to a common pool of delocalised electrons. 

In their ground states atoms of hydrogen and helium cannot absorb 
radiation at optical frequencies because the first excited states of these atoms 
lie rather far above the ground state (10.2 and 19.8eV, respectively). The 
first excited state of lithium is obtained by promoting the n = 2 electron 
from l = 0 to l = 1. This change in quantum numbers only increases the 
electron’s energy by virtue of shielding (§8.1.3), so the energy difference is a 
mere 1.85 eV, the quantity of energy carried by photons of wavelength 671 nrn 
that lie towards the red end of the optical spectrum. Elements that lie beyond 
helium in the periodic table, the so-called heavy elements, feature very 
prominently in astronomical measurements even though they are present in 
trace amounts compared to hydrogen and helium because their absorption 
spectra contain lines at easily observed optical wavelengths. 

There is a useful parallel between a lithium atom and a hydrogen atom 
in its first excited state: the lithium nucleus, shielded by the two n = 1 
electrons, appears to have the same net charge as the proton in hydrogen, 
so the n = 2 electron moves in a similar electric field to that experienced by 
an electron with n = 2 in hydrogen. We can test this parallel quantitatively 
by comparing the ionisation energy of lithium (5.39 eV) with the energy of 
H with n = 2 (3.40eV). This agreement is not terribly good because the 
n = 2, l = 0 wavefunction that forms the ground state of lithium overlaps 
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Figure 9.2 The first five rows of the periodic table. 
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significantly with the n = 1 wavefunction, and therefore has exposure to 
the full nuclear charge. There is a more satisfying parallel between the first 
excited state of lithium, in which the n = 2 electron has l = 1 and the 
corresponding state of hydrogen: in this state lithium has ionisation energy 
3.54 eV. 

Consider now the effect of transmuting lithium into beryllium by simul¬ 
taneously increasing the nuclear charge by one unit and adding a second 
electron to the n = 2, l = 0 state. The parallel that we have just described 
suggests that this operation will be analogous to moving up from hydrogen 
to helium, and will significantly increase the ionisation energy of the atom. 
Experiment bears out this expectation, for the ionisation energy of beryl¬ 
lium is 9.32 eV, 1.7 times that of lithium. As in helium, the ground state 
of beryllium has total spin zero, while the first excited states have spin one. 
However, whereas in the excited states of helium the two electrons have dif¬ 
ferent values of n, in beryllium they both have n = 2, and they differ in their 
values of l. Consequently, the overlap between the single-electron states that 
form the beryllium triplet is significantly larger than the corresponding over¬ 
lap in helium. This fact makes the exchange integral in equations (10.32) 
large and causes the singlet excited state to lie 2.5 eV above the triplet of 
excited states. 

If we add a unit of charge to the nucleus of a beryllium atom, we create 
an atom of singly ionised boron. The four electrons with l = 0 that envelop 
the ion’s nucleus screen the nuclear charge to a considerable extent from the 
perspective of the lowest-energy unfilled single-particle state, which is a 2 p 
state (n = 2,1 = 1). The screening is far from complete, however, so the 
nuclear charge Z that the outermost electron perceives is greater than unity 
and the dynamics of the outermost electron of boron is similar to that of the 
electron in a liydrogenic atom with Z > 1. The ionisation energy from the 
n = 2 level of hydrogen is \Z 2 1Z = 3.40 Z 2 eV, while that of boron is 8.30 eV, 
so Z ~ 1.6. 

Spin-orbit coupling causes the ground state of boron to form the 2 P \/2 
term in which the electron’s spin and orbital angular momenta are antipar¬ 
allel. At this early stage in the periodic table, spin-orbit coupling is weak, 
so the excited states of the 2 P 3 / 2 term lie only 0.0019 eV above the ground 
state. C + ions have the same electronic configuration as boron atoms, and 
in interstellar space are more abundant by factors of several thousand. Even 
at the low temperatures (~ 20 K) that are characteristic of dense interstel¬ 
lar clouds, collisions carry enough energy to lift C + ions into the low-lying 
excited states of the 2 P 3 / 2 term, so such collisions are often inelastic, in 
contrast to collisions involving the very much more abundant hydrogen and 
helium atoms and hydrogen molecules. At the low densities prevalent in in¬ 
terstellar space, an excited C + ion usually has time to return to the ground 
state by emitting a photon before it is involved in another collision. So C + 
ions cool the interstellar gas by radiating away its kinetic energy. As a re¬ 
sult of this physics, the temperature of interstellar gas depends sensitively 
on the abundances in it of the commonest heavy elements, carbon, nitrogen 
and oxygen. The propensity of interstellar gas to collapse gravitationally 
into stars depends on the temperature of the gas, so the formation of stars 
depends crucially on the existence of low-lying excited states in boron and 
the next few elements in the periodic table. 

When we add another unit of charge to the nucleus of a boron atom, 
the binding energy of the outermost electron increases by a factor of order 
(6/5) 2 = 1.44. Adding a further electron, which can go into another 2 p state 
alongside the existing outer electron, offsets this increase in binding energy 
to some extent, so we expect the ionisation energy of carbon to lie some¬ 
where between the 8.30 eV of boron and 1.44 times this value, 12.0 eV. The 
experimental value is 11.3 eV, which lies at the upper end of our anticipated 
range, implying that the mutual repulsion of the two 2 p electrons is not very 
important energetically. This is to be expected because the ground state of 
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carbon belongs to the triplet term 3 3 Pq, so the electrons keep out of each 
other’s way. As in boron, the first excited states lie very close to the ground 
state - they form a 3 Pi term 0.0020 eV above the ground state, and there is 
a 3 P 2 term 0.0033 eV above that. 

Adding a unit of charge to the nucleus of carbon and then dropping 
an electron into another 2 p single-particle state creates a nitrogen atom. 
The ionisation energy increases (to 14.5 eV) for exactly the same reason that 
it did when we transmuted boron into carbon. The spin of all three 2 p 
electrons are aligned to ensure that the wavefunction is antisymmetric in the 
electrons’ spatial coordinates. Hence the ground state of nitrogen belongs 
to a quadruplet of states. The total orbital angular momentum proves to be 
zero (Problem 10.7), so all states in this quadruplet have the same energy, 
and there are actually four distinct ground states that together comprise the 
term 4 S 3 /2 - it is sometimes rather confusingly said that the ground ‘state’ 
of nitrogen is four-fold degenerate. The lowest excited states form the 2 D 3 / 2 
term, and they lie 2.3835 eV and 2.3846 eV above the ground state. 

Since there are only three single-particle spatial wavefunctions available 
with 1 = 1 , namely the wavefunctions for m = ±1,0, when we add an elec¬ 
tron to nitrogen to form oxygen, the overall wavefunction cannot be totally 
antisymmetric in the spatial coordinates of the electrons. The result is that 
in oxygen the electrons are less effective in keeping out of each other’s way 
than are the electrons in carbon and nitrogen, and the ionisation energy of 
oxygen (13.6 eV) is slightly smaller than that of nitrogen. The ground states 
form the term 3 P 2 , while states in the 3 P± and 3 Po terms lie 0.020eV and 
0.028 eV above the ground state. In these terms three of the electrons have 
cancelling orbital angular momenta as in nitrogen, so the orbital angular 
momentum of the atom is just the single unit introduced by the fourth elec¬ 
tron: hence the P in the ground-state term. Spin-orbit interaction causes 
the ground state to have the largest available value of j, whereas in carbon 
the reverse was the case. 

The easiest way to understand fluorine, the element that follows oxygen 
in the periodic table, is to skip ahead two places from oxygen to neon, in 
which a full house of six electrons is packed into the 2 p states. Every spin is 
paired and every value of the orbital angular momentum quantum number in 
is used twice, so both the spin and the orbital angular momenta sum to zero. 
Each of the six 2 p electrons is exposed to a large fraction of the ten units 
of charge on the nucleus, so the ionisation energy is large, 21.6 eV, second 
only to helium of all the elements. There are no low-lying excited states. 
This fact together with the large ionisation potential makes neon chemically 
inactive. 

We transmute neon into fluorine by taking away a unit of nuclear charge 
and one of the 2 p electrons. The ‘hole’ we have left in the otherwise complete 
shell of 2 p electrons behaves like a spin-half particle that carries one unit of 
orbital angular momentum. Hence the ground state of fluorine has s = i 
and l = 1. Spin-orbit interaction causes the 2 P 3 / 2 term to lie 0.050 eV below 
the 2 P \/2 term that also arises when spin-half is combined with one unit of 
orbital angular momentum. In the case of oxygen we encountered a similar 
maximisation of j, and it turns out that the ground states of atoms with 
shells that are more than half full generally maximise j, while j is minimised 
in the ground state of an atom with a shell that is less than half full. 

We have now reached the end of the first long period of the table. The 
second long period, from sodium to argon, is an almost perfect copy of the 
period we have just covered. Figure 10.3 illustrates this statement by show¬ 
ing the ionisation energies of the elements in the first three periods. There 
is an abrupt drop in the ionisation energy as we move from neon to sodium, 
from an inert noble gas to a highly reactive alkali metal. Then the ionisation 

3 See Problem 10.6 for an explanation of why the ground state of carbon has l = 1 
rather than l = 0 or l = 2, which are the other possible results of combining two electrons, 
each of which has 1 = 1. 
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Figure 10.3 Ionisation energies of the first nineteen elements. 
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Figure 10.4 The lowest-lying energy 
levels of carbon silicon and germa¬ 
nium. Along this sequence the fine- 
structure splitting between the three 
lowest-lying levels increases dramat¬ 
ically. In the case of germanium the 
spread in the energies of the triplet 
states is no longer negligible com¬ 
pared to the energy gap between the 
lowest-lying singlet state and the top 
triplet state. 


energy creeps up as one moves along the period, with two small setbacks, be¬ 
tween magnesium and aluminium, and between phosphorus and sulfur, that 
are associated with the start and the half-way point of the 3 p, respectively. 

10.3.2 The fourth and fifth periods 

After reaching argon with its full 3 p shell, one might expect to start filling 
the 3d shell. Actually the 4s states prove to be lower-lying because their 
vanishing angular momentum allows much greater penetration of the cloud 
of negative charge associated with the electrons that have n < 3. But once 
the 4s shell has been filled, filling of the 3d shell commences with scandium 
and continues unbroken through zinc. Once the 3d shell is full, filling of the 
4 p shell commences, finishing with the noble gas krypton at Z = 39. 

In the next period, filling of the 5s shell takes precedence over filling of 
the 4 d shell, and when, after cadmium at Z = 48, the 4 d shell is full, filling 
the 5 p shell takes precedence over filling the 4/ shell. The last two periods 
are very long and have complex patterns due to the availability of shells with 
large l that tend to be filled much later than shells with the same n but small 
l. 

ft is instructive to compare the pattern of energy levels as we move down 
one column of the periodic table. Figure 10.4 shows the lowest energy levels 
of carbon and the elements, silicon and germanium, that lie beneath it in 
the periodic table. As we saw above, carbon has at the bottom of its energy- 
level diagram a cluster of three very closely spaced energy levels that form the 
terms 3 Pj for j = 0, 1, 2. As we proceed through silicon and germanium the 
spacing within this cluster grows markedly because it is caused by spin-orbit 
coupling, which scales like Z 4 (§8.2.1). By the time we reach silicon the 
energy differences created by spin-orbit coupling are no longer very small 
compared to the energy difference between the triplet and singlet states, 
which we know is of electrostatic origin. The total electron spin operator 
S 2 = (y~). S, ) 2 does not commute with the term in the Hamiltonian which is 
generated by spin-orbit interaction, which is proportional to JA S,; • L,. So 
long as this term is small compared to the terms in the Hamiltonian that do 
commute with S 2 , total electron spin is a good quantum number and it is 
meaningful to describe the atom with a spectroscopic term such as 3 Pq. In 
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reality the atom’s wavefunction will include contributions from states that 
have l ^ 1, but the coefficients of these terms will be very small, and for most 
purposes they can be neglected. As the contribution to the Hamiltonian from 
spin-orbit coupling grows, the coefficients in the ground-state wavefunction 
of terms with l ^ 1 grows, and the designation 3 Po becomes misleading. The 
lowest-lying three levels of germanium can for most purposes be treated as 
3 Pj terms. In the case of tin, which lies under germanium, the designation 
3 Pj is highly questionable, and in lead, which lies under tin, it is valueless. 

Problems 

10.1 Show that when the state of a pair of photons is expanded as 

W) = n)\n'), (10.33) 

nn' 

where (|n)} is a complete set of single-photon states, the expansion coeffi¬ 
cients satisfy b nn i = b n ' n . 

10.2 By substituting from equation (10.22) for tp 0 into equation (10.20), 
express the singlet state of an electron pair 0,0) as a linear combination of 
products of the single-particle states |it, ±) and |v, ±) in which the individual 
electrons are in the states associated with spatial amplitudes u(x) and v(x) 
with S z returning ±-(. Show that your expression is consistent with the Pauli 
condition a nn r = —a n ' n . 

Given the four single-particle states | u, ±) and |u, ±), how many linearly 
independent entangled states of a pair of particles can be constructed if 
the particles are not identical? How many linearly independent states are 
possible if the particles are identical fermions? Why are only four of these 
states accounted for by the states in first excited level of helium? 

10.3 Show that the exchange integral defined by equation (10.32b) is real 
for any single-particle wavefunctions Ti and T 2 . 

10.4 The H - ion consists of two electrons bound to a proton. Estimate 
its ground-state energy by adapting the calculation of helium’s ground-state 
energy that uses the variational principle. Show that the analogue for H _ of 
equation (10.30) is 

( H) = lZ(2x 2 — ^4-x ) where x = —. (10.34) 

4 a 

Hence find that the binding energy of H is ~ 0.9457?.. Will H~ be a stable 
ion? 

10.5* Assume that a LiH molecule comprises a Li + ion electrostatically 
bound to an H ion, and that in the molecule’s ground state the kinetic 
energies of the ions can be neglected. Let the centres of the two ions be 
separated by a distance b and calculate the resulting electrostatic binding 
energy under the assumption that they attract like point charges. Given that 
the ionisation energy of Li is 0.407? and using the result of Problem 10.4, 
show that the molecule has less energy than that of well separated hydrogen 
and lithium atoms for b < 4.4ao. Does this calculation suggest that LiH is a 
stable molecule? Is it safe to neglect the kinetic energies of the ions within 
the molecule? 

10.6* Two spin-one gyros are a box. Express that states | j,m) in which 
the box has definite angular momentum as linear combinations of the states 
11 , m) | l,mf) in which the individual gyros have definite angular momentum. 
Hence show that 

|o,o) = ^(|i,-i)|i,i>Hi,o>|i,o) + |i,i)|i,-i>) 

By considering the symmetries of your expressions, explain why the ground 
state of carbon has l = 1 rather than l = 2 or 0. What is the total spin 
angular momentum of a C atom? 
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10 . 7 * Suppose we have three spin-one gyros in a box. Express the state 
10, 0) of the box in which it has no angular momentum as a linear combination 
of the states |1, m)|l, m , )|l, m") in which the individual gyros have well- 
defined angular momenta. Hint: start with just two gyros in the box, giving 
states | j, to) of the box, and argue that only for a single value of j will it be 
possible to get |0,0) by adding the third gyro; use results from Problem 10.6. 

Explain the relevance of your result to the fact that the ground state of 
nitrogen has 1 = 0. Deduce the value of the total electron spin of an N atom. 

10 . 8 * Consider a system made of three spin-half particles with individual 
spin states |±). Write down a linear combination of states such as |+)|+)|—) 
(with two spins up and one down) that is symmetric under any exchange of 
spin eigenvalues ±. Write down three other totally symmetric states and say 
what total spin your states correspond to. 

Show that it is not possible to construct a linear combination of products 
of |±) which is totally antisymmetric. 

What consequences do these results have for the structure of atoms such 
as nitrogen that have three valence electrons? 



11 

Adiabatic principle 


We often need to understand the quantum mechanics of systems that have 
a large number of degrees of freedom. We might, for example, be interested 
in the speed at which sound waves propagate through a macroscopic crystal 
of diamond. This depends on the deformability of the bonds between the 
crystal’s carbon atoms, which is in turn determined by the orbits of the 
atoms’ electrons. Also relevant is the inertia of a block of crystal, which 
is mostly contributed by the carbon nuclei. These nuclei are dynamical 
systems, in which protons and neutrons move at mildly relativistic speed. 
Each proton or neutron is itself a dynamical systems in which three quarks 
and some gluons race about relativistically. When a sound wave passes 
through the crystal, each nucleus experiences accelerations that must affect 
its internal dynamics, and the dynamics of its constituent quarks. Is there 
any chance that a sound wave will induce a nucleus to transition to an excited 
state? Could a sound wave cause an atom to become electronically excited? 

So long as such transitions are realistic possibilities, it is going to be 
extremely difficult to calculate the speed of sound, because the calculation 
is going to involve atomic physics, nuclear physics and quantum chromo- 
dynamics - the theory strong interactions and quarks, which governs the 
internal structure of protons and neutrons. The adiabatic approximation, 
which is the subject of this chapter, enables us to infer that such transitions 
are exceedingly unlikely to occur. Consequently, in this case and a vast num¬ 
ber of similar situations, the adiabatic approximation greatly simplifies our 
problem by permitting us to neglect phenomena, such as electron or nuclear 
excitation, that have energy scales that are significantly larger than the char¬ 
acteristic energy scale of the phenomenon under investigation, even though 
the different degrees of freedom are dynamically coupled. Moreover, we shall 
see that the adiabatic approximation enables us to calculate quantities such 
as the spring constant of the bonds that bind a crystals’s atoms from the 
dynamics of the electrons that form these bonds. It also provides the theo¬ 
retical underpinning for the kinetic theory of gases, for most of condensed- 
matter physics and much of chemistry. It is enormously important for the 
development of quantum field theory and our understanding of quantum 
chrornodynamics. Hence, the adiabatic approximation is an extraordinarily 
important tool with applications that span the natural sciences. 

We start by deriving the adiabatic approximation. Then we study in 
turn elementary applications of it to kinetic theory, to thermodynamics, to 
condensed-matter physics, and to chemistry. 
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11.1 Derivation of the adiabatic principle 

In §2.2 we stressed that the tdse (2.26) is valid even when H is time- 
dependent. However, we have mostly confined ourselves to Hamiltonians 
that are time-independent. In §9.3 we did consider a time-dependent Hamil¬ 
tonian, but we assumed that the time-dependent part of H was small. Now 
we consider the case in which H can change by an amount that is large, so 
long as the time T over which this change takes place is long in a sense that 
will be specified below. 

We consider the dynamics of a system that has a slowly varying Hamil¬ 
tonian H{t). At any instant, H(t) has a complete set of eigenkets \E n {t)) 
and eigenvalues E n (t). For the case of vanishing time dependence, equa¬ 
tion (9.43) provides an appropriate trial solution of the tdse (2.26). After 
modifying this solution to allow for time-variation of the E n , we have 


ItM) = y^M£)exp ^ J dtf E n (lf)j\E n (t)). 


( 11 . 1 ) 


Since for each t the set {| E n [t))} is complete and the numbers a n (t) are 
arbitrary, no assumption is involved in writing down this expansion of the 
system’s ket. When we substitute the expansion (11.1) into the tdse, we 
find 

m^=H\^) = Y,a n ^V^\f^t'E n {t')^H{t)\E n {t)) 


— ^ ) ( {4 FlCln 4“ ttn E n (f)} | E n (f)) -(- \TlClr 

n ' 


d\En) 

dt 


( 11 . 2 ) 


x exp — 


d t' E n (t’] 


Exploiting the fact that \E n (t)) is an eigenket of H(t ) we can cancel a term 
from each side and are left with 

0 = Y (q»I E n (t)) + a n ^Q^ ) exp ^ ^ J d t'E n (t')j. (11.3) 

Now we use the perturbation theory developed in §9.1 to expand \E n (t + 
6t)) as a linear combination of the complete set (|£' ra (t))}. That is, we write 

| E n (t + 8t)) - | E n (t)) = Y, bnm\E m (t)), (11.4) 

m^n 


where from (9.9) we have 


(E m (t)\SH\E n (t)) 
En(t) - E m (t) 


(11.5) 


with SH the change in H between t and t + St. Dividing equation (11.4) by 
St and substituting the result into (11.3) we find 


0 = YA a n \E n (t))+a , 


E 

m^n 


(E m {t)\H\E n (t)) , 
E„(t) - E m {t) 


\E m ) \ exp - 


d t'E n (t')y 

( 11 . 6 ) 
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Figure 11.1 A plot of sin (kx) times the slowly varying function 1/(1 -\-x 2 ). As k —»■ oo 
and the wavelength of the oscillations becomes shorter, the negative contribution from 
each section of the curve below the x axis more nearly cancels the positive contribution 
from the preceding upward section. 


When we multiply through by (E k (t) | this yields 


Ofc 


= -£< 
riy^k 


i(t) 


(E k (t)\H \E n {t)) 
E n (t) - 


^f exp (~U 


(H-7) 

Although we have used first-order perturbation theory, our working so far 
has been exact because we can make 5H as small as we please by taking 
St to be small. Now we introduce an approximation by supposing that H 
is a slowly varying function of time in the sense that it changes by very 
little in the time Ti/ min(|i? n — E k |), which is the time required for significant 
motion to occur as a result of interference between the stationary states with 
energies E n and E k (§3.2). In this approximation, the right side of equation 
(11.7) is a product of a slowly varying function of time and an approximately 
sinusoidal term that oscillates much more rapidly. When we integrate this 
expression to get the change in a k , the integral vanishes rather precisely 
because the contributions from adjacent half-periods of the oscillating factor 
nearly cancel (Figure 11.1). Hence, if initially a k = 1 for some k, it will 
remain unity throughout the evolution. This completes the derivation of the 
adiabatic approximation: if a system is initially in the k th state of well- 
defined energy, it will stay in this state when the Hamiltonian is changed 
sufficiently slowly. 


11.2 Application to kinetic theory 

Consider air that is being compressed in the cylinder of a bicycle pump. 
The air resists the compression by exerting pressure on the cylinder and 
its piston, and it grows hot as we drive the piston in. This phenomenon 
is usually explained by treating the air molecules as classical particles that 
bounce elastically off the cylinder walls. In this section we use the adiabatic 
principle to interpret the phenomenon at a quantum-mechanical level. 

We proceed by first imagining that there is only one molecule in the 
cylinder, and then making the assumption that when there are a large num¬ 
ber N of molecules present, the pressure is simply N times the pressure we 
calculate for the single-particle case. The Hamiltonian that governs our basic 
system, a particle in a box, is 

H(t) = ^- + V(x,t), (11.8) 


where the potential V (x, t) is provided by the walls of the box. The simplest 
model is 


V(x,t) 


0 for x in the cylinder 
oo for x in a wall or the piston. 


(11.9) 


The time dependence of V arises because the piston is moving. We need 
to find the eigenvalues E n and eigenkets \E n ) of the Hamiltonian (11.8). 
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We work in the position representation, in which the eigenvalue equation 
becomes 

h 2 

-—V 2 u n + Vu n =E n u n (11.10) 

2m. 

with u n (x) = (x| E n ). From §5.1.1(a) we have that u n should vanish on 
the walls of the cylinder and the piston. For x inside the cylinder, the 
second term on the left of equation (11.10) vanishes, so E n and u n (x) are 
the solutions to 

h 2 

— —— X 2 u n = E n u n with u n = 0 on the boundary. (11.11) 


We assume that the cylinder’s cross section is rectangular or circular, so 
coordinates exist such that (i) the cylinder’s walls are all surfaces on which 
one coordinate vanishes and (ii) the Laplacian operator separates. That is, 
we can write 



( 11 . 12 ) 


where V) is an operator that depends on the two coordinates, x and y, 
that specify location perpendicular to the cylinder’s axis, and z is distance 
down that axis. In this case, we can find a complete set of solutions to 
equation (11.11) for eigenfunctions that are products u n (x) = X(x,y)Z(z) 
of a function X of x and y , and a function of z alone. Substituting this 
expression for u n into equation (11.11) and rearranging, we find 


zx\x 


2mE n 


-XZ = -X 


,d 2 Z 

dz 2 ' 


(11.13) 


When we divide through by XZ , we find that the left side does not depend 
on z while the right side does not depend on x or y. It follows that neither 
side depends on any of the coordinates. That is, both sides are equal to 
some constant, which we may call 2 m£ z /h 2 . This observation enables us to 
separate our original wave equation into two equations 


V 2 X = 2 m(E £ z ) x 

h 2 

d 2 Z 2 m£ z v 


(11.14) 


The physical content of these equations is clear: £ z is the kinetic energy 
associated with motion along the cylinder’s axis, so motion perpendicular to 
the axis carries the remaining energy, E n — £ z . As we push in the piston, 
neither the equation governing X and E n — £ z nor its boundary conditions 
change, so E n —£ z is invariant. What does change is the boundary condition 
subject to which the equation for Z has to be solved. 

We place one end of the cylinder at z = 0 and the piston at z = L. 
Then it is easy to see that the required solution for Z is [cf. §5.1.1(a)] 


Z(z) oc sm(knz/L) with A; = 1,2,..., (11.15) 


and the possible values of £ z are 

- 5 &X- <”• 16 > 

The adiabatic principle assures us that if we let the piston out slowly, 
the particle’s value of the quantum number k will not change, and its energy 
£ z will evolve according to equation (11.16). By conservation of energy, the 
energy lost by the particle when L is increased by d L must equal the work 
that the particle does on the piston, which is PdV, where P is the pressure 
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it exerts and dV is the increase in the cylinder’s volume. Let A be the area 
of the piston. Then conservation of energy requires that 

—d£ z = 2£ Z ^L = PAdL } (11.17) 

Li 

from which it follows that 

2 £ z £ z 

P=—A = 2—. 11.18 

AL V y ’ 

When we sum the contributions to the pressure that arise from a large 
number, N, of molecules in the cylinder, equation (11.18) yields 

P = 2^(£ z ), (11.19) 

where the angle brackets mean the average over all molecules. At this point 
we have to take into account collisions between the N molecules. Colliding 
molecules change the directions of their momenta and thus transfer energy 
between motion in the z direction and motion in the plane perpendicular to 
it. Collisions do not satisfy the adiabatic approximation, so they do change 
the quantum numbers of particles. Their overall effect is to ensure that the 
velocity distribution remains isotropic even though the piston’s motion is 
changing £ z and not the energy of motion in the plane of the piston, E n — £ z . 
So we may assume that ( E) = 3 (£ z ). Let U = N (E) be the internal energy 
of the gas. Then eliminating (£ z ) from equation (11.19) in favour of U , we 
obtain 

PV = | U. (11.20) 

This result is identical with what we obtain by combining the equation of 
state of an ideal gas, PV = Nk^T, with the expression for the internal 
energy of such a gas, U = ^NksT. Actually our result is more general 
than the result for an ideal gas because we have not assumed that the gas 
is in thermal equilibrium: the only assumption we have made about the 
distribution of kinetic energy among particles is that it is isotropic. 

11.3 Application to thermodynamics 

In §6.4 we saw that when a system is in thermodynamic equilibrium, we 
do not know what quantum state it is in but can assign a probability pi oc 
e -Ei/k B T ^Fat p, j s j n its i th stationary state (eq. 6.93a). The energy Ei 
of this state depends on the variables, such as volume, electric field, shear 
stress, etc., that quantify the system’s environment. In the simplest non¬ 
trivial case, that in which the system is a fluid, the only relevant variable is 
the volume V and we shall consider only this case. Hence we consider the 
energy of each stationary state to be a function A,(V). 

In an adiabatic compression of our system, we slowly change V while 
isolating the system from heat sources. From the adiabatic principle it follows 
that during such a compression the system remains in whatever stationary 
state it was in when the compression started. Consequently, the probabilities 
Pi of its being in the various stationary states are constant, and the entropy 
S = —kn E,; Pi hip,; (eq. 6.91) is constant during an adiabatic change, just 
as classical thermodynamics teaches. 

During an adiabatic compression, the change in the internal energy U = 

Ei Pi E i is 

d U = 'Y^Pi^r = where P = - ^ Pi ( n - 21 ) 

i i 

Since there is no heat flow, the increment in U must equal the work done, 
which is the pressure that the system exerts times —dV, so the quantity P 
defined by equation (11.21) is indeed the pressure. 
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11.4 The compressibility of condensed matter 


As a second application of the adiabatic principle, we estimate the compress¬ 
ibility of solids and liquids. In condensed matter atoms touch one another 
and the volume of the bulk material can be reduced only if every atom is 
made smaller. If an atom’s electrons are to be confined to a smaller volume, 
by the uncertainty principle, their momenta and therefore their kinetic en¬ 
ergies must increase. We estimate the compressibility of matter by equating 
the work done in compressing it to the resulting increase in the energy of 
the atom. The adiabatic approximation tells us that during slow compres¬ 
sion, the atom remains in its ground state. Hence the compressibility can be 
deduced if we can calculate the ground-state energy E 0 as a function of the 
atom’s volume V. 

Compressibility \ is defined to be the fractional change in volume per 
unit applied pressure P: 


1 dV 
VdP' 


( 11 . 22 ) 


Conservation of energy implies that — P dV, the work done by the compressor, 
is equal to the increase in the ground-state energy dA 0 , so P = — dA 0 /dV 
and 


X = 



(11.23) 


Eq(V) can be obtained by solving for the atom’s stationary states with the 
electronic wavefunction required to vanish on the surface of a sphere of vol¬ 
ume V. A highly simplified version of such a calculation enables us to obtain 
a rough estimate of the compressibility of condensed matter. 

We assume that when the atom is confined in a sphere of radius a, 
its wavefunction (x| a) is the same as the wavefunction for the confining 
sphere of radius a o with all distances rescaled by a/ao and the appropriate 
adjustment in the normalisation. In this case, we can argue as in §9.2 that 
the expectation value (a\K\a) of the atom’s kinetic energy operator K scales 
as (ao/a) , while the expectation value of the potential-energy operator V 
scales as ao/a. Hence 


d E 0 
da 


da 


((a|AT|a) + (a|Vja)) ~ -2 


(a|A» 


(a|T|a) 

a 


(11.24) 


Equation (8.52) states that 2(a\K\a) = —(a|V|a), so the right side of this 
equation vanishes. 1 Differentiating again, we find 


d 2 A 0 _ (a|A» 9 (o|U|a) 

da 2 a 2 a 2 


- 2 - 


.Et 


(11.25) 


where equation (8.52) has been used again to simplify the right side. Since 
V oc a 3 , dV/da = 3V/a and bearing in mind our result that d£b/da = 0 we 
find 


d 2 Ao ^ / a \ 2 d 2 Ao _ 2 Eq 

dV 2 ~ V3V/ da 2 “ V 2 ' 


(11.26) 


Using this result in equation (11.23), we conclude that the compressibility is 


X - 


9 _^ 

2 |£o|' 


(11.27) 


Some care is required in the application of this result to many-electron 
atoms. Our assumption that (a|V|a) scales as a -1 is valid only if the wave- 
function is simultaneously rescaled in the coordinates of all the atom’s elec¬ 
trons. Unfortunately, it is physically obvious that, at least for small fractional 

1 Equation (8.52) was actually only derived for hydrogen, but the result applies to the 
gross structure of any atom. 
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changes in volume, only the outermost shell of electrons will be significantly 
affected by the confining sphere. So realistically we should assume that the 
system formed by the inner electron shells remains fixed and the wavefunction 
is rescaled in its dependence on the coordinates of electrons in the outermost 
shell. In this spirit we shall replace |£p| by N times the atom’s ionisation 
energy, where N is the number of electrons in the outermost shell. Since the 
electrostatic potential produced by the nucleus and the fixed inner shells of 
electrons does not vary with radius as r' 1 , (a|V|a) will not scale as a -1 and 
the factor | in equation (11.27) will be in error. None the less, the equation 
should correctly predict the order of magnitude of an atom’s compressibility. 

For lithium we take V = |7r(2ao) 3 and |-E 0 | = 5.39 eV to find \ = 
2.6 x 10” 11 Pa -1 . The measured value varies with temperature and is of order 
10~ 10 Pa -1 , which is in excellent agreement with our quantum-mechanical 
estimate given the sensitivity of the latter to the adopted value of the rather 
ill-defined parameter V. 


11.5 Covalent bonding 

The air we breathe, the living tissue of our bodies, and the plastics in the 
clothes, chairs and carpets that surround us, are held together by covalent 
bonds. These are bonds between two atoms of similar electronegativity, such 
as two oxygen atoms, two carbon atoms or a carbon atom and an oxygen 
atom. In this section we explain how they arise through the sharing by the 
atoms of one or more electrons. Unlike the ionic bonds that hold together a 
crystal of common salt, which are crudely electrostatic in nature, a covalent 
bond is intrinsically quantum-mechanical. 


11.5.1 A model of a covalent bond 

To show how covalent bonding works, we study a one-dinrensional model 
that is not at all realistic but it is analytically tractable. 2 We imagine a 
particle of mass m that moves along the x axis in a potential V ( x ) that is 
made up of two 6- function potentials of the type we introduced in §5.1.1(b). 
The wells are separated by distance 2a: 

V{x) = — Vs{5(x + a) + S(x — a)}. (11.28) 

We have placed the origin at the midpoint between the two wells, which we 
can do without loss of generality. This placement ensures that the Hamil¬ 
tonian commutes with the parity operator, and we can seek solutions of the 
TISE that have well-defined parity. There are three distinct regions in which 
V(x) = 0, namely x < — a , —a < x < a and x > a and in these regions the 
wavefunction u(x) must be a linear combination of the exponentials e^ , 
where k is related to E by 


k = V-2mE /h. (11.29) 

With an eye to the construction of solutions of definite parity we let our 
solutions in these regions be 

( e kx for x < —a, 

u(x) oc < cosh (kx) or sinh(fc:r) for —a < x < a, (11.30) 

[ e~ kx for x > a, 


2 Physicists call a model that lacks realism but nonetheless captures the physical 
essence of a phenomenon, a toy model. 
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Bearing in mind that the wavefunction has to be continuous across the po¬ 
tential wells, we see from Figure 11.2 that solutions of each parity must be 
of the form 


u+(x) = A x 


gfc( x+a) 

cosh {kx)/ cosh(fca) 

e -k(x-a) 


u_(x) = B x 


_gk(x+a) 

sinh(fcx) / sinh(/ca) 

e -k(x-a) 


for x < —a, 
for —a < x < a, 
for x > a, 

for x < —a, 
for —a < x < a, 
for x > a, 


(11.31) 


where the constants on each segment of the real line have been chosen to 
ensure that the wavefunctions equal A and B, respectively, at x = a. 

On account of the symmetry of the problem, it suffices to choose k for 
each parity such that the equation (5.13) is satisfied at x = a. From this 
equation we have that 


( k{ 1 + tanh(fca)} for even parity 
( k{l + coth(fca)} for odd parity 


(11.32) 


where K is defined by equation (5.14). By expressing the hyperbolic func¬ 
tions in terms of e ka , we can rearrange the equations into 

4 - 1 = ±e~ 2ka , (11.33) 

K 

where the upper sign is for even parity. In the upper panel of Figure 11.3 
the left and right sides of these equations are plotted; the solution k is the 
ordinate at which the straight line of the left side intersects with the decaying 
exponential plot of the right side. The value of k+ that we obtain from 
the upper curve associated with the even-parity case is always larger than 
the value obtained for the ocld-parity case. By equation (11.29), the 
particle’s binding energy increases with fc, so the even-parity state is the 
more tightly bound. If we increase a, the exponential curves in the top panel 
of Figure 11.3 become more curved and approach the x-axis more rapidly. 
Hence k+ diminishes, and fc_ grows. In the limit a — > oo, the exponentials 
hug the axes ever more tightly and and k- converge on the point k = K 
at which the sloping line crosses the ai-axis. This value of k is precisely that 
for an isolated well as we would expect, since in the limit a —> oo the wells 
are isolated from one another. The lower panel of Figure 11.3 shows the 
energies associated with k± from equation (11.29). 

Suppose our particle is initially in the ground state of two wells that 
are distance 2a apart, and imagine slowly moving the two wells towards one 
another. By the adiabatic principle, the particle stays in the ground state, 
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l/(Ka) 


Figure 11.3 Graphical solution of equation (11.33). In the top panel, the exponential is 
drawn for the case Ka = 1. Bottom panel: binding energy versus inverse separation. The 
scale energy Eq = —Ti 2 K 2 /2m is the energy of the bound state of an isolated 5-function 
potential well. 

which, as we have seen, moves to lower energies. Hence the particle loses 
energy. Where does this energy go? It can only go into the mechanism that 
positions the wells. A little thought reveals that if work is done on this 
mechanism, it must be resisting the mutual attraction by the holes. Hence 
we have arrived at the conclusion that two potential wells that are jointly 
binding a particle, can experience a mutual attraction that would not be 
present if the bound particle were absent. 


11.5.2 Molecular dynamics 

The toy model just presented describes an inherently quantum-mechanical 
mechanism by which atoms experience mutual attraction or repulsion through 
sharing a bound particle. An essentially identical calculation, in which the 
energy E e of the two shared electrons on an H 2 molecule is studied as a 
function of the separation b of the protons, enables one to understand the 
structure of the H 2 molecule. Analogously with the toy model, the energy 
of the shared electrons decreases monotonically with 5, so in the absence of 
the mutual electrostatic repulsion of the protons, which had no analogue in 
our model, the electrons would draw the protons ever closer together. In 
reality there is a particular value &o of b at which the rate at which E e (b) 
decreases equals the rate at which the electrostatic energy E p (b) of the pro¬ 
tons increases with decreasing b. The classical intranuclear separation of an 
H 2 molecule is 60 . 
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A more complete theory is obtained by considering V ( b ) = E e + E p to 
be a dynamical potential in which the two nuclei move. The analysis of this 
problem proceeds in close analogy with our treatment of the hydrogen atom 
in §8.1: we introduce a reduced particle and observe that the Hamiltonian 
that governs its dynamics commutes with the reduced particle’s angular- 
momentum operator; this observation enables us to derive a radial wave 
equation for each value of the angular-momentum quantum number l. This 
radial wave equation describes oscillations that are governed by the effective 
potential V(b). The rotation-vibration spectrum of H 2 may be understood 
as arising from transitions between states that are characterised by l and the 
quantum number n associated with the oscillations in b. 

Similar principles clearly apply to studies of the dynamics of many other 
molecules: one starts by determining the energy of shared electrons for a 
grid of fixed locations of the nuclei. The resulting energies together with 
the energy of electrostatic repulsion between the nuclei yields an effective 
potential, that can then be used to study the quantum dynamics of the 
nuclei. The essential approximation upon which this kind of work depends 
is that the frequencies at which the nuclei oscillate are low compared to any 
difference between energy levels of the electronic system, divided by Ti. Since 
electrons are so much lighter than nuclei, this approximation is generally an 
excellent one when the molecular rotations and vibrations are not strongly 
excited. The approximation is guaranteed to break down during dissociation 
of a molecule, however. We return to the toy model to explain why this is 
so. 


11.5.3 Dissociation of molecules 

In the model of §11.5.1, the force provided by the particle that is shared 
between the potential wells is not always attractive: if the particle in the 
excited, odd-parity bound-state the energy of the particle increases as the 
separation of the wells 2 a is diminished, so the positioning mechanism must 
be pushing the two wells together as it resists the mutual repulsion of the 
wells. Consider now a two-well molecule that is held together by the attrac¬ 
tive force provided by the shared particle in its ground state when a photon 
promotes the particle to its excited state. Then the force provided by the 
particle becomes repulsive, and the wells will begin to move apart. As they 
move, much of the energy stored in the excitation of the particle is converted 
into kinetic energy of the wells, and soon there is one bare well and one well 
with a trapped particle. 

As the wells move apart, the energy difference between the ground and 
excited states decreases, while the rate of increase of a increases. Hence 
the adiabatic approximation, which requires that a/a <C (E 0 — E e )/h must 
break down. In a more complex system, such as a real CO molecule, this 
breakdown can cause some of the energy stored in the particle’s excitation 
being transferred to excitation of one or both of the final atoms rather than 
being converted to the kinetic energy of their motion apart. 


11.6 The WKBJ approximation 

In §5.4 we learnt from a numerical solution of the tise that when a particle 
encounters a modest ramp in its potential energy V , the amplitude for re¬ 
flection is small unless the distance over which V changes is small compared 
to the particle’s de Broglie wavelength. This result is closely related to the 
adiabatic approximation in the sense that in the particle’s rest frame, the 
potential that it experiences changes slowly compared to the time taken to 
cover a de Broglie wavelength. Now we given an important analytical ar¬ 
gument that leads to the same conclusion and allows us to determine the 
evolution of the wave as it moves over the barrier. 
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The equation we wish to solve is 

—-y = —k 2 il> where k 2 (x) = -^-(E — V), (11.34) 

da. h 

and V ( x ) is some real function. We define 

r x \/2rn r x , _ 

(f>(x) = / d x'k(x') = —-— / da;' s/E — V{x '), (11.35) 

so k = d(f>/dx. Then without loss of generality we write 

4>{x) = S(x)e l<l> , (11.36) 


where S' is a function to be determined. When we substitute this expression 
for into the tise (11.34), cancel the right side with one of the terms on 
the left and then divide through by e 1<?i , we obtain 


d 2 S dS . d k n 

——— + 2i k— —I- iS—— — 0. 

dx z da’ da; 


(11.37) 


Now we reason that when k is a slowly varying function of x, S will be 
also. In fact, if k changes on a lengthscale L 1/k that is greater than 
the wavelength 2n/k, we will have |dfc/da:| ~ \k\/L, |dS/da;| ~ |S |/L and 
|d 2 S/dx 2 | ~ |S|/L 2 . In these circumstances it follows that the first term 
in equation (11.37) is negligible compared to the other two, and we may 
approximate the equation by 


d In S i d In k 

da; 2 da; 

Integrating both sides from a;i to X 2 we find that 


(SVk)\ xi = (Sy/k) 


(11.38) 


(11.39) 


The particle flux implied by ij>(x) is proportional to the probability density 
|S| 2 times the particle speed k/m. Hence equation (11.39) states that the 
particle flux at x^ is equal to that at X 2 - In other words, when the wavenum¬ 
ber changes very little in a wavelength, the reflected amplitude is negligible 
and the wavefunction is approximately 


i/>{x) ~ constant x 


2 m{E - V) 



(11.40) 


where <p(x) is given by equation (11. 35,). This solution is known as the 
WKBJ approximation. 3 The WKBJ approximation guarantees conserva¬ 
tion of particle flux in the classical limit of very small de Broglie wavelengths. 
It also has innumerable applications outside quantum mechanics, including 
the working of ear trumpets, tidal bores and Saturn’s rings. 


3 The WKBJ approximation is named after Wentzel, Kramers, Brillouin and Jeffreys. 
This is frequently abbreviated to ‘WKB approximation’. 
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Problems 

11.1 In §9.1 we obtained estimates of the amount by which the energy 
of an atom changes when an electric or magnetic field is applied. Discuss 
whether the derivation of these results implicitly assumed the validity of the 
adiabatic principle. 

11.2 In §11.2 we assumed that the potential energy of air molecules is 
infinitely large inside a bicycle pump’s walls. This cannot be strictly true. 
Give a reasoned order-of-magnitude estimate for the potential in the walls, 
and consider how valid it is to approximate this by infinity. 

11.3 Explain why E/ui is an adiabatic invariant of a simple harmonic os¬ 
cillator, where w is the oscillator’s angular frequency. Einstein proved this 
result in classical physics when he was developing the “old quantum theory”, 
which involved quantising adiabatic invariants such as E/co and angular mo¬ 
mentum. Derive the result for a classical oscillator by adapting the derivation 
of the WKBJ approximation to the oscillator’s equation of motion x = —cu 2 x. 

11.4 Consider a particle that is trapped in a one-dinrensional potential 
well V(x). If the particle is in a sufficiently highly excited state of this 
well, its typical de Broglie wavelength may be sufficiently smaller than the 
characteristic lengtlrscale of the well for the WKBJ approximation to be valid. 
Explain why it is plausible that in this case 


1 

h 



da/ \/2m{E — V(x')} 


= 717T, 


(11.41) 


where the E —V(xi) = 0 and n is an integer. Relate this condition to the 
quantisation rule j> dxp x = nh used in the “old quantum theory”. 

11.5 Show that the “old quantum theory” (Problem 11.4) predicts that 
the energy levels of the harmonic oscillator are nhoj rather than (n + ^ )Hoj. 
Comment on the dependence on n of the fractional error in E n . 

11.6 Suppose the charge carried by a proton gradually decayed from its 
current value, e, being at a general time fe. Write down an expression for 
the binding energy of a hydrogen atom in terms of /. As a —> 0 the binding 
energy vanishes. Explain physically where the energy required to free the 
electron has come from. 

When the spring constant of an oscillator is adiabatically weakened by 
a factor / 4 , the oscillator’s energy reduces by a factor / 2 . Where has the 
energy gone? 

In Problems 3.14 and 3.15 we considered an oscillator in its ground 
state when the spring constant was suddenly weakened by a factor / = 1/16. 
We found that the energy decreased from ^hu; to 0.2656S.U; not to Tiui/ 512. 
Explain physically the difference between the sudden and adiabatic cases. 

11.7 Photons are trapped inside a cavity that has perfectly reflecting walls 
which slowly recede, increasing the cavity’s volume V. Give a physical mo¬ 
tivation for the assumption that each photon’s frequency v oc V' 1 / 3 . Using 
this assumption, show that the energy density of photons u oc V -4 / 3 and 
hence determine the scaling with V of the pressure exerted by the photons 
on the container’s walls. 

Black-body radiation comprises an infinite set of thermally excited har¬ 
monic oscillators - each normal mode of a large cavity corresponds to a new 
oscillator. Initially the cavity is filled with black-body radiation of tem¬ 
perature To- Show that as the cavity expands, the radiation continues to 
be black-body radiation although its temperature falls as V” 1 / 3 . Hint: use 
equation (6.121). 
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11.8 Show that when a charged particle gyrates with energy E in a uni¬ 
form magnetic field of flux density B , the magnetic moment p = E/B is 
invariant when B is changed slowly. Hint: recall Problem magmomentprob. 
By applying the principle that energy must be conserved when the magnetic 
field is slowly ramped up, deduce whether a plasma of free electrons forms a 
para- or dia-nragnetic medium. 



12 

Scattering Theory 


In this chapter we study situations in which a free particle approaches a 
region of enhanced potential, is deflected, and moves away in a new direction. 
Different potentials lead to different probabilities for a particle to be scattered 
in a particular direction, so by carefully measuring the outcomes of repeated 
scattering experiments, we can infer the potential that was responsible. 

In fact, most of what we know about the small-scale structure of matter 
has been learnt this way. For example, Rutherford discovered that atoms 
have dense, compact nuclei by studying the distribution of a-particles scat¬ 
tered by gold foil, while nowadays we study the sub-atomic structure of mat¬ 
ter by scattering extremely fast-nroving electrons or protons off one another 
in high-energy accelerators. 

The task of scattering theory is to build a bridge between the Hamiltoni¬ 
ans that govern the evolution of states and the quantities - cross sections and 
branching ratios - that are actually measured in the laboratory. In §5.3 we 
investigated the scattering of particles that are constrained to move in one 
dimension, and found that quantum mechanics predicts qualitatively new 
scattering phenomena. We expect the freedom to move in three dimensions 
rather than one to be fundamental for the physics of scattering, so in this 
chapter we investigate three-dimensional scattering. We shall find that the 
new phenomena we encountered in §5.3 do carry over to physically realistic 
situations. 


12.1 The scattering operator 

Let | ip) be the state of a particle in a scattering experiment. The evolution 
of | i/j) is governed by the tdse 

= H\ip) <^> |tM) = U{t)\ip;0), (12.1a) 

where U(t) = j g tj me evolution operator introduced in §4.3. We 

break the Hamiltonian into a sum H = Hk + V of the kinetic-energy operator 
H k = p 2 /2m and the potential V that causes scattering. If \tp) represents a 
moving particle - one that approaches or leaves some collision - it must be a 
non-trivial superposition of the eigenstates of H - see §2.3.3. Unfortunately, 
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when V ^ 0 we may not know what these eigenstates are, and it may be 
prohibitively difficult to find them. 

A crucial physical insight allows us to make progress: in a scattering 
experiment, long before the particle reaches the interaction region it is ap¬ 
proximately free. The evolution of a free particle \<p) is governed by H k 


[h di\^ = Hk ^ ° r = 


(12.1b) 


where U^{t) = e iHkt / n , so the statement that \ip) behaves like a free particle 
in the asymptotic past is the requirement that 


lim U(t) |'i/’;0) = lim UK{t)\(f>; 0) 

£—>•—oo t —>—oo 


(12.2a) 


for some free state | </>). Implicit in this equation is the assumption that the 
origin of time is chosen such that the interaction takes place at some finite 
time. For example, it might be in full swing at time t = 0, which we shall 
sometimes refer to as ‘the present’. 

In the asymptotic future the scattered particle will have moved far away 
from the interaction region and will again be approximately free. Hence we 
also require 

lim U(t)\ip;0) = lim Uk (t )\<j>' ; 0) (12.2b) 

t —^ -|-oo t —^ ~I - OO 

where |<//) is another free state. 

Equations (12.2) allow us to relate the real state \ip) to both \cf>) and 
\(f>') at the present time via 

IV’lO) = lim tft( t )tf K (i)|0;O> = lim U\t')U^)\<t>'-,Q) 

t—> — oo t'— H-oo /in on 


= ^+l < />; o) = o), 

where the operators are defined by 1 


= lim [/i(t)17 K (i). 

£—OO 


The origin of the irritating choices of sign in this definition will be explained 
in §12.2 below. In terms of the f 1± operators, the scattering operator S 
is defined as 

5 = (12.5) 

Here’s what the scattering operator does: first, S evolves a free-particle state 
back to the distant past, then matches it onto a real state which has the same 
past asymptotic behaviour. Next, S evolves this real state forwards - all the 
way through the scattering process to the far future. There, the real particle 
again behaves like a free particle, and S matches their states before finally 
evolving the free state back to the present. If the real particle is in some 
state |^), and looks like a free-particle state \(f)) well before the interaction 
occurs, then the amplitude for it will look like some other free state |A) long 
after the interaction is (A|<S|</>). Hence the probability to find the particle in 
the free state |A) is just |(A|<S|^)| 2 . The scattering operator is useful because 
it always acts on free states, so if we use it we do not need to know the 
eigenstates of the full Hamiltonian H. 

Notice that S is defined as a product of four unitary evolution operators 
and is therefore itself unitary. 

When V = 0, the particle isn’t scattered, and its future state is the 
same as its past state. In such circumstances, the scattering operator must 
be just the identity operator 5=1, and we can check this is indeed true by 

1 It is not self-evident that the limits as t —¥ ±oo that appear in equation (12.4) exist. 
Appendix K derives a condition on V that ensures that is well defined. 
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putting H = Hk in equations (12.3) to (12.5). The operator that describes 


a genuine interaction is the transition operator 

T = S-1 (12.6) 

and the probabilities for actual transitions are given by 

Prob(|</>) |A)) = \(X\m\ 2 = |(A|5|0) - <A|0>| 2 . (12.7) 

Since S is unitary, we have 

i = 5 f 5 = i + r f + r+r i T. (12.8) 

Squeezing this equation between (</>| and | </>) we obtain 

-2Ke«0|T|0» = {<t>\r^r\(t>). (12.9) 

We also have that 

k^i^)i 2 = 11 + wmf = 1+2j?e((0i rm + mrm 2 - (12.10) 

Rearranging and using equation (12.9) yields 


i-Msm 2 = £ (<p\T'\i>i)(ipi\T\<f>) = J2 ( 12 . 11 ) 

where {| 4>i)} is a complete set of states that includes \<f>). The left side of 
equation (12.11) is one, minus the probability that at t = +oo the particle 
is still in the state it was in at t = — oo, while the right side is the sum of 
the probabilities that the particle has made the transition to some state \ipi) 
different from the original state | <j>). 


12.1.1 Perturbative treatment of the scattering operator 

The definition S = is difficult to use in practical calculations, because 

the true Hamiltonian H (whose eigenstates we do not know) is buried rather 
deep inside. To get at it we first differentiate f l(t) = U' {t)U\<i{t) with respect 
to t, finding 

-^ft(t) = i e im/n {H - H K )e~ iHKt/R = L e ^t/hy e -iH K t/h , (12.12) 

where we have been careful to preserve the order of the operators. We now 
re-integrate this equation between t' and t to reach 


n(t) = net') + i [ d T e iHT/R Ve- iHKT/h 
ft Jt> 

= n{t') + i j AtU\t)VU k (t). 
Taking the Hermitian adjoint of this equation we have 


o \t) = n\t') 

= fit(t') 


l -j\rUl{T)VU{r) 
i 


(12.13) 


(12.14) 
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The integrand itself contains fT(r), but suppose we use this equation to 
replace it by Q(t') plus an integral that involves Qf (V). and then repeat this 
process once more. The result is 

n\t) = n\t') - ± JdrU^{ t)VUk(t) 

- ^2 U k(t)VUk(t - T')VU K (T’)rt(t') 

+ -^ [dr [dr 1 [ cl t" U^(t)VU k (t - t')VU k {t' - t”)VU k {t")^(t"). 
h Jf jf Jt" 

(12.15) 

Through repeated use of equation (12.14) we can push the operator flt(r) 
for t > t' off into an integral that contains as many powers of V as we please. 
For sufficiently small V , the magnitude of the term in which f2(r) occurs will 
be negligible, and we will be able to drop it. Then multiplying the equation 
by ST(f'), and taking the limits t —> oo and t' —> —oo, we obtain an expansion 
of S in powers of V. Since fl' (t')Cl(t') = 1, this expansion is 


i f°° 

5=1-- J dr U^(t)VUk(t) 

1 /*»o rr 

- -j / dr / dr' U^(t)VU k (t 

iL J —OO J — OO 

• /»00 /»T pT 


- t')VU k (t') 


+ ^3 / dr / dr' / dr" U^(t)VU k (t - t’)VU k {t' 

h J — oo j —oo J —oo 


t")VU k (t") 


_|_ ... 

(12.16) 

The virtue of equation (12.16) is that all the evolution operators involve only 
the free Hamiltonian Hk information about scattering has been encoded 
in the expansion in powers of V. 2 

Equation (12.16) has an intuitive physical interpretation. The zeroth- 
order term is the identity operator and represents no scattering; its presence 
was anticipated by equation (12.6). The term S' 1 - 1 with one power of V acts 
on a free particle as 


• ,oo 

<A;0|S<%;0>=-- J dr(A;O|[4(r)Vtf K (r)|0;O> 
= J dr(A; r\V\(f>; r). 


(12.17) 


The integrand (A; r| V|0; r) is the amplitude for a particle in the free state | <f>) 
to be deflected by the potential V at time r, transferring it into another free 
state |A). Since we only observe the initial and final states, we do not know 
when the interaction took place, so we add the amplitudes for the deflection 
to have occurred at any time. Similarly, the second-order term S ^ gives the 
amplitude 


<A;0|5(%;0)= ^ J7rJ dr' (A; t\VU k (t — t 7 )^;t') (12.18) 

for an incoming particle in the free state | <j>) to be deflected by the potential 
at time rb then to propagate freely for a further time r — r', and finally 
to be deflected again by V into the final state |A). Since we do not know 

2 This expansion is reminiscent of the perturbation theory developed in §9.1. However, 
that theory hinged on the assumption that the response of the system to changes in its 
Hamiltonian is analytic in the parameter (3. Here we need no such assumption. Instead 
we guess that certain integrals become small. 
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when either deflection occurred, we integrate over all r and t' . subject to 
the condition t' < r that the first deflection happens earlier. Higher-order 
terms describe trajectories that involve larger numbers of deflections. 

If the potential is sufficiently weak, we might hope to approximate equa¬ 
tion (12.16) by its lowest-order terms 

• pOO 

S ~ 1 - i J dr uI(t)VU k (t). (12.19) 

This drastic curtailing of the series for S is known as the Born approxima¬ 
tion. Whether it is a good approximation in a given physical situation must 
be checked, often by estimating the order-of-magnitude of the second-order 
term 50 ), and checking it is acceptably smaller than the Born term. 


12.2 The S-matrix 

It is impossible to put any physical particle into a pure energy eigenstate, 
because such states are not localised in time. Nonetheless, energy eigenstates 
are useful as mathematical tools, being simpler to handle than realistic su¬ 
perpositions. Calculating with energy eigenstates presents special problems 
in scattering theory, because the idea that the particle moves towards the 
potential is central to our entire formalism, but a particle that is in an energy 
eigenstate goes nowhere. 


12.2.1 The ie prescription 

To get to the root of the problem, notice that equation (12.4) implies that 


HQ± = i h — 
d t 


3 -i Ht/h 


Q± 


= i?i 
= ih 
= ihfl± 


d_ 
d t 
d_ 
d t 


t =o 

t=o 

t=0 

cl 
d t 


lim e- iHt/n UUT)U K (T) 

T —>=POO y 

lim U\t — t)U\^{r — t)U^{t) ) 

T—>=P00 J 

e -iH K t/h = Q ±Hk 


( 12 . 20 ) 


t=0 


Therefore, if the interacting state \ip) initially resembles some eigenstate 
|S; free) of the free Hamiltonian, then equations (12.2a) and (12.20) imply 
that 


H\^) = HQ+\E; free) = Q + H K \E-, free) = EQ+\E-, free) = E |?/>), (12.21) 

so \ip) must actually be an eigenstate |.E;true) of the true Hamiltonian, with 
the same energy E. The trouble with this is that energy eigenstates look the 
same at all times, so IE 1 ; true) and | E] free) would always look like each other 
- a state of affairs that only makes sense if V = 0 and there is no scattering. 

Our argument shows that the initial and final states cannot have well- 
defined energy - they must be non-trivial superpositions of energy eigen¬ 
states. Nonetheless, because energy eigenstates often simplify otherwise dif¬ 
ficult calculations, we are reluctant to forego them. Instead, we seek a way 
to avoid the problem. From equation (12.13), write fl± in the form 

= 1 + - / dr U\t)VUk (t) 
ft Jo 
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When 12+ is applied to a real scattering state \<j>), the integrand vanishes for 
large r, because ( x.\Uk\4> ) is non-negligible only far from the scattering centre, 
where (x| V|x)' ~ 0. Hence, if we include a convergence factor e~d T \/ h in the 
integrand, for sufficiently small e > 0 we make a negligible difference to the 
action of 12+ on a real scattering state: for finite r this factor approximates 
unity to arbitrary accuracy as e —> 0. Consider therefore the operator 12 that 
has this harmless factor. Taking the limit that the constant e approaches 
zero from above we obtain 


Ot|0> —= fl+ lim 4 r dTU^T)Ve- eM/h U K {T)] |0>. 

V e^0+ hj o J 

(12.23) 

The action of 12+ on a state \<f>) that is a non-trivial superposition of en¬ 
ergy eigenstates is identical to that of 12+. However, Problem 12.1 shows that 
the product HQ± satisfies an equation that differs crucially from equation 
( 12 . 21 ): 

HT2+= 12±(Hk± ie)=Fie. (12.24) 

Consequently, when we apply H to | ip) = |£ 7 ; free), where |.E;free) is an 

eigenstate of Hr, we find 


H\ip) = HCl±\E; free) = (E ± ie)\ip) =F ie|£7; free), (12.25) 

so | ip) is an eigenstate of H only when V = 0 and | if)) = \E; free). Therefore, 
when we use 12+ to generate ‘interacting’ states from eigenstates of Hk, 
which will henceforth be simply labelled \E), our interacting states are not 
stationary states of the true Hamiltonian, and thus can describe scattering. 
The crucial point that makes the whole procedure consistent is that for any 
physically realistic superposition, it makes no difference whether we construct 
interacting states with 12+ or 1 1±. 

We can simplify Cl± a little: since |r| = r for r > 0 and |r| = —t for 
t < 0, with | (j>) = | E) equation (12.23) becomes 

Q±\E) = ^1+ lim drt/ f (r)ye“ i(B±ie)r/ ^ \E). (12.26) 

Therefore, our modification merely supplements the energy eigenvalue E 
with a small imaginary piece +ie for initial states and — ie for final states 
- the sign on ie corresponds to the subscript on f l± and is historically the 
origin of the naming of the fl operators. This procedure is known as the 
ie prescription. In practice the prescription is implemented by using the 
original f2± operators, but pretending that all eigenstates of Hk satisfy 


Hk\E) = {E + ie)| E) for initial kets when acted on by 11+ 
Hk\E') = ( E' — ie)I-E7 7 ) for final kets when acted on by H_. 


(12.27a) 


Similarly, the Hermitian adjoints of the modified operators (12.23) imply 
that we should likewise pretend that 

(E\Hk = (E\(E — ie) for initial bras when acted on by fit 
X 1 X IV ’ ^ + (12.27b) 

(. E'\Hk = ( E'\(E ' + ie) for final bras when acted on by Sll. 

In no way do we mean that the Hermitian operator Hk actually has a com¬ 
plex eigenvalues E; equations (12.27) are merely useful fictions that enable 
us to carry out the ie prescription. 
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12.2.2 Expanding the S-matrix 

Since the incoming and outgoing states are free states, and the momentum 
operators commute with the free Hamiltonian Hk, the scattering operator is 
conveniently studied in the momentum representation. We then work with 

the S-matrix 

«s(p,p') = (p'|«5|p). (12.28) 

where we must use the ie prescription of equations (12.27) to interpret the 
action of on |p) that is implicit in this definition. From equation (12.16), 
the lowest-order contribution to the S-matrix is then 


<P'|S|p> 


i 

(p'|p) - - / dr {p'\e iHKT/h Ve- iHKT/h \p) 
n J-oo 

• roo 

(p'lp)-f(p'l^lp) / Cl 
' l J-oo 


(12.29) 


Here, we used the rules (12.27) to find that the argument of the exponential 
in the integrand is actually independent of e. We recognise the integral as 
2nhS(E p — E p i). 

Potentials that depend only on position are diagonal in the x represen¬ 
tation, so the momentum-space elements (p'|y|p) are 

(p'|H|p) = [ d 3 xd 3 x' (p'|x')(x , |^|x)(x|p) 

(12.30) 


where we have used our expression (2.78) for the wavefunction of a state of 
well-defined momentum and defined the momentum transfer 


(2irh) 3 


d 3 xe _lqx 


/h V{y 


q = p' - p. (12.31) 

Therefore, the Born approximation to the S-matrix just depends on the 
Fourier transform of E(x): 

(p'\ S W\p) = -j^S(E p -E p ,) j d 3 xe -iq ' x / R V(x). (12.32) 

From the theory of Fourier transforms, we see that potentials which vary 
rapidly with x lead to S-matrices that contain significant amplitudes for large 
momentum transfers. Turning this around, if a particle suffers a large change 
in momentum when it is scattered by V (x), we infer that V (x) has sharp 
features. Arguing along these lines (albeit more classically), Rutherford was 
able to deduce the existence of nuclei from the occasional back-scattering 
of a-particles off gold foil. More recently, a team of physicists 3 working 
at SLAC in Stanford scattered high-energy electrons off protons; the elec¬ 
trons sometimes suffered large-angle scattering, providing evidence for the 
existence of quarks inside the nucleons. 

The second-order term in the scattering operator can be treated in a 
similar manner. From equation (12.16) we find 

(p'|5( 2) |p) = --2 / dr / c\t' ( P '\uI(t)VU k (t - t')VU k (t')\p). (12.33) 

The free-evolution operators can be evaluated by inserting the identity op¬ 
erator 1 = Jd 3 k|k)(k| anywhere between the two V operators. Bearing in 


3 D.H. Coward, et. al., Phys. Rev. Lett. 20, 292, (1968). 
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mind that Hk |p) = ( E p + ie) |p) and (p'|i?K = (p'\(E p > + ie) in accordance 
with the ie prescription, we find 


<p'|S< 2 >|p) = f lim -1 J d 3 k^(p'|F|k)(k|V|p) 


e*00 nT 

d r / e i(E pl +ie-E k )T/h e -i(E p -E k +ie)T'/H 


> —oo J — oo 


The integral over r' is 


dr' e -i(E p -E k+ ie)r’/n = m 


e -i(E p -E k +ie)T/h 

Ep — Ek + ie 


(12.34) 

(12.35) 


so the second-order contribution to the S-nratrix is 


5< 2 >(P'.P) 


lim u gp 

e->o+ n J E p — E/~ + ie 


dr e~'^ E p~ E p'^ T / n 


= —2'ki5(E p 


E V A lim 
p £-> 0 + 



(p'|F|k)(k|F|p) 
E p — Ek + ie 


(12.36) 

The numerator in the integrand is the amplitude for the particle to scatter 
from the |p) state into the |k) state, and then from the |k) state into the |p') 
state. The denominator arose from the integration over r', which in turn was 
present because the particle travelled freely for some time r — t' in between 
the two interactions. Since it comes from this free propagation, the factor 
(Ep — Ek + ie)^ 1 is known as the propagator, written here in the momentum 
representation. Finally, because we do not measure the intermediate state, 
equation (12.36) adds up the amplitudes for scattering via any state. 

Higher order terms are handled in a similar way: V occurs n times in 
«S (n) (p',P), so there are n — 1 intermediate evolution operators, leading to 
n — 1 propagators. Similarly, there are n—1 sets of intermediate states, all of 
which are integrated over, ^^(p', p) may be represented diagrammatically 
as in Figure 12.1. These Feynman diagrams are an order-by-order book¬ 
keeping system for calculating contributions to the S-nratrix: each term in 
the series for the S-matrix corresponds to a diagram, and Feynman rules 
can be defined that enable the algebraic expression for the term to be inferred 
from the diagram. Thus Feynman diagrams summarise complicated integrals 
in an intuitive way. 

The Feynman rules required here are extremely simple: (i) each vertex 
has just two lines going into it and is associated with a factor V ; (ii) each 
‘internal line’ (one that has a vertex at each end) is associated with the prop¬ 
agator (Ep-Ek- fie) -1 , where k, which is integrated over, is the momentum 
carried by that line; (iii) there is an overall prefactor —27ri S(E p — E p >), where 
p and p' are the ingoing and outgoing momenta, respectively. With these 
rules we can only construct one diagram with a given number n of vertices, 
and it’s a simple chain. Feynman diagrams become much more interesting 
and valuable when one recognises that when an electron is scattered by an 
electrostatic potential V (x), for example, it really collides with a photon, and 
one needs to include the coupled dynamics of the photons. In this more so¬ 
phisticated picture, V (x) is replaced by the electromagnetic vector potential 
A, which becomes a quantum-mechanical object, and our diagrams include 
propagators for both photons and electrons. Moreover, the vertices become 
points at which three or more lines meet, two for the incoming and outgoing 
electron, and one or more for photons. With a richer set of lines and vertices 
on hand, many different diagrams can be constructed that all have the same 
number of vertices, and therefore contribute to the S-matrix at the same 
order. 
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Figure 12.1 Feynman diagrams 
for the scattering process to lowest 
orders in V. 


12.2.3 The scattering amplitude 

Both the first- and second-order approximations to the S-matrix are pro¬ 
portional to an energy-conserving delta function. This result is not limited 
to the series expansion for S, but actually holds for the exact S-matrix as 
we now demonstrate. Equation (12.20) and its Hermitian adjoint state that 
Hfl± = Q±Hk and fl±H = Hk^±- Now S = so 

h k s = h k sitft + = nlHn + = nlfi+H K = sh k , (12.37) 

that is, [5,77 k] = 0. Sandwiching this commutation relation between mo¬ 
mentum eigenstates and using the ie prescription of equations (12.27) gives 
the relation 

0 = (p'|[5,i7 K ]|p) = {E p + ie - E p , - ie)S(p,p') = (E p - E p >)S{p,p'), 

(12.38) 

so the S-matrix vanishes unless the initial and final states have the same 
(real) energy. This tells us that the exact S-matrix must have the form 
5(p,p') oc 6{E P - E p ,). 

In equation (12.6) we broke S into the sum S = 1 + T to isolate the 
scattering amplitude, and it is clear that (p'|T|p) is also proportional to 
8{Ep — E p i). Motivated by this insight we define the scattering amplitude 
/(P -> P') by 


(p, m p > = — }~S(E P - E p ,)f(p -> p'), (12.39) 

where the factor of i/(27rftm) is included for later convenience. On account 
of the delta function, f{p—> p') depends on p' only through its direction p'. 

To understand the significance of the scattering amplitude, consider the 
following argument. According to the discussion in §12.1, long after the 
interaction, a particle that scattered from the free state \<f>) can be described 
by the free state |A) = S\cj)). Therefore, in the idealised case that the initial 
state was a momentum eigenstate |p), the wavefunction of the final state is 

(r|A) = (r|«S|p) = (r|p) + f d 3 p' (r\p')(p'\T\p) 

J (12.40) 

= + 2^hm J “ E p’)f(P P')- 

Since the states |p') in the integrand are final states, the ie prescription tells 
us to take p ,2 /2 m = ( E p > — ie), so in spherical polar coordinates 4 

cl 3 p' = p' 2 dp 'dfl = m\J2m(Ep' — ie) difp'dfl. (12.41) 

Using this in equation (12.40) and integrating over E p , using the delta func¬ 
tion gives 


(r|A) = (r|p)- 


1 yj2m{Ep ie) j iry /2m(E p - ie) 


(27 T ^,) 5 / 2 


p r 


/R f(p p ')- (12.42) 


4 In this chapter it is convenient to define = sin^d^d^) rather than d 2 r2 = 
sin 6 d 6 d (f) as in earlier chapters. 
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This free-particle wavefunction only looks like the true wavefunction of the 
scattered particle long after the collision, so equation (12.42) will only cor¬ 
respond to the physical wavefunction as r —> oo. In this limit, the phase of 
the exponential in equation (12.42) varies extremely rapidly as a function 
of the variables 9 and <j> that define the direction of p, over which we are 
integrating. The different contributions to the integral will therefore cancel 
each other out except where the phase of the integrand is stationary with 
respect to angle. For sufficiently large r the sensitivity of the exponential to 
angle will exceed that of /(p —> p'). Hence the dominant contribution to 
the integral arises when 

A a n 

— (p' • r) = — cos 9 = 0 and ^-cos6» = 0, (12.43) 

where we have aligned the polar axis with the (fixed) direction f . These 
conditions are satisfied when 9 = 0, tt, independent of <j>. When 9 = tt, and 
p' • r = —1, the integrand of equation (12.42) is exponentially suppressed as 
r — > oo by the ie prescription. Therefore the integral over the unit sphere is 
dominated by the contribution from a small disc centred on the direction r. 
This insight justifies the approximation 


JdCle ir y/ 2rn ^- ie) p'- i/n f (p p ') 

~ 27 r/(p -p p 7 ) J dcos 0 e iW2r n ( E P -ie)c° S 9/n 

~ 2-jfi 11 E P 2 _ f >iry/2m(E p -ie)/h 

vryj2m[E p — ie) 

Using this expression in equation (12.42) we have finally 


(12.44) 


/ l e iry/2m(E p -ie)/h N 

lim (r| (t>) = lirn (r|p) + — , - - - /( P -t pr) 


r—>oo r—>oo 


(27 rh) 3 / 2 r 


1 / pi pr/h 

= lim IO 3/2 ( eip r/h + -) ’ 

r->oo (27 t n) 6 / z \ r 


(12.45) 


where in the last line we have taken the limit e —► 0 + . Equation (12.45) shows 
that a particle that was initially in a momentum eigenstate will emerge from 
the scattering process in a superposition of its original state (no scattering) 
and a wave travelling radially outwards. The scattering amplitude /(p — > pr) 
is just the amplitude of this outgoing wave. 

In equation (12.45) the time-dependence is suppressed by our convention 
that the S-matrix generates the wavefunction at the generic time 0. We now 
restore explicit time dependence by introducing a factor e~ lE P t / n and replace 
the incoming state |p) by a realistic superposition of such states. Then the 
outgoing wavefunction becomes 


( r l</>; t) 


d 3 P^(P) fj(p r-E p t)/n 
(2nh) 3 / 2 \ 


3 i (pr-E p t)/h 


-/(P pr) 


(12.46) 


which is the sum of the incoming wave packet plus a wave packet that travels 
radially outwards from the scattering centre. 
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12.3 Cross-sections and scattering experiments 

Children sometimes test their skill by taking turns to throw pebbles at a dis¬ 
tant target, perhaps a rock. If a pebble hits, it will bounce off in a different 
direction, whereas a pebble that misses will simply continue undisturbed. 
Each throw will not be repeated exactly, and after a long time we might 
imagine that the children have thrown pebbles randomly, such that the dis¬ 
tribution of throws per unit area is uniform over a region surrounding the 
target. If so, we can estimate the area of the target that the children see by 
simply counting the number of pebbles that hit it if N[ n pebbles are thrown 
in per unit area, and N sc of them hit the rock, the rock has cross-sectional 
area 

A ~ N sc /N in . (12.47) 

With more care, we can measure the angle through which throws are de¬ 
flected. Pebbles that strike nearby points of a smooth rock will bounce off 
in roughly the same direction, whereas a jagged rock may deflect pebbles 
that hit closely spaced points very differently. Hence, counting the number 
of pebbles that end up going in a given direction gives us information about 
the rock’s shape. We define the differential cross-section do to be the 
area of the target that deflects pebbles into a small solid angle SSI. If there 
are N(9 , (f>)SSl such pebbles, then 


. N(9,<f>)SSl 

da ~^v- - 

1 v in 


So N{9,</>) 

sn - N in 


and the total cross-section is 


^tot 




dfl 


N (M) 

iVin 


Nsc 

N ir 


(12.48) 


(12.49) 


as above. 

This may seem a rather baroque manner in which to investigate rocks, 
but when you go out on a dark night with a torch, you probe objects in a very 
similar way by throwing photons at them. A more complete analogy can be 
drawn between pebble-throwing children and physicists with particle accel¬ 
erators: a beam containing a large number N\ y of particles is fired towards 
a target, and detectors measure the number of particles that scatter off into 
each element of solid angle 5Q. Long before the collision, a typical particle 
in the beam looks like a free state \<j>), so the probability density of each 
particle is |(x|(/>}| 2 and the number of particles per unit area perpendicular 
to the beam direction is 

Uin(xj_) = N h J dx|| I(x|<(>}| 2 , (12.50) 

where the integral is along the beam direction. 

When \(j>) is expanded in terms of momentum eigenstates, equation 
(12.50) becomes 

Mxi) = ^3 J dZ||d 3 pd 3 pV( p - p '> X /^(p)<f (p') 

= J d 3 pd 3 p' 5(p\\ 

(12.51) 

where the integral over arii produced the delta function of momentum along 
the beam direction. Experimental beams are highly collimated, so </>(p) 
vanishes rapidly unless the momentum is near some average value p. In 
particular, they contain only small amounts of momentum perpendicular to 
the beam direction p = p/|p|. Consequently, throughout a region of non- 
negligible extent near the centre of the beam, at x_l = 0, we have e lpj -' Xi / R ~ 
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1. With this approximation, the number of particles incident per unit area 
perpendicular to the beam is uniform near the beam centre, so 

n in (xjJ ~ j d 3 pd 3 p'd(py -p||)0(p)0*(p')- (12.52) 


Equation (12.52) may seem a bizarre way to rewrite the intuitively clear 
expression (12.50), but it will soon prove its worth. 

We must now calculate N{9,<j>). At large distances, we know that the 
wavefunction of particles scattered from the state | <f>) is (r\T\(j)). If we had 
placed detectors at some large distance ro from the scattering centre, over 
time they would have detected any particle that has the same values of 9 , q i 
and is predicted to lie at r > ro- Thus the total number of particles that are 
detected in the element of solid angle Sfl is 

POO 

N(9,(/))6n= / dr r 2 7V b <5fl |(r|T|</>)| 2 . (12.53) 

J ro 

Equation (12.46) gives (r|</>} = (r|(1 + T)|0), so 


N(6, <fi)6n = N h Sfl / dr 


’ d 3 p^(p) ipr/a , 

(2tt^) 3/2 6 /lP 


' pr) 


(12.54) 


So long as the scattering amplitude is reasonably smooth, the collimation 
of the beam allows us to replace /(p pi) by its value at the average 
momentum /(p —> pi), which gives 


N(9,(/))Sn = * b dn|/(p->pr)|» dre l(p P ' )r/R . 

(12.55) 

Explicitly writing the incoming particle’s momentum in terms of the average 
momentum p of the beam and its deviation dp from this value, we find 


P = \/(p + dp) • (p + dp) 


P 



p_dp\ 

P 2 ) 


= p + p-dp, 


(12.56) 


so the argument of the exponential in equation (12.55) involves 

{p-p') ~ (dp - dp') • p = (dpy - dpj|) = (p|| -pf|). (12.57) 

Since r > ro is very large, the phase of this exponential oscillates rapidly, so 
again the integral is dominated by contributions for which py = pj| giving 

N(9, cj))Sfl ~ IVb dll|/(p -> pi)\ 2 J ^^<KpM*(pO%|| -p'\\) ^ 125g , 

= «in|/(p-^pf)| 2 dO, 


where we have used equation (12.52). 

Combining this with the definition (12.48) of the differential cross-section, 
we find dcr/dll for scattering from momentum p (now relabelled) into a dif¬ 
ferent momentum p' of the same magnitude: 5 

^ = I/(P^P')| 2 , (12-59) 

where p' points towards the centre of the element of solid angle df2. The 
total scattering cross-section is 

0tot = /dfll/Cp^pOl 2 - (12.60) 

5 Our language here is loose: neither the incoming nor the outgoing states are strictly 
states of well-defined momentum. 
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The two remarkably simple formulae (12.59) and (12.60) form crucial 
links between experiment and theory. If the scattering potential is sufficiently 
weak that the Born approximation is valid, equation (12.32) tells us that the 
scattering amplitude is /(p —> p') = —47r 2 fi.m(p'|V|p), and the differential 
cross-section is 


der 

an 


(47t 2 Km) 2 1 (p / 1141 p) | 2 




(12.61) 


where q = p' p. The integral in equation (12.61) is just the Fourier 
transform V (q) of the potential, so the equation can be rewritten 


der 

dfi 


m 


4:TT 2 h‘ 


-P(Q ), 


(12.62) 


where P(q) = |F(q)| 2 is the power spectrum of V (x). Thus, by measuring 
the number of particles that are scattered into a given direction, we can 
determine the power spectrum of the interaction potential. 

If we could complement this information by measuring the phases of the 
Fourier transform, we could reconstruct V (x) from the scattering data. The 
obvious way to measure the phases is to observe interference between the 
scattered and incident amplitudes - interference of this type is what gener¬ 
ates holograms, from which the three-dimensional structure of the scattering 
object can be reconstructed. A high-energy accelerator does not produce 
sufficiently pure quantum states (in the sense of §6.3) for interference be¬ 
tween the incident and scattered amplitudes to be observable. Moreover, in 
realistic circumstances, experiments in which this interference was observed 
would be of limited interest because in reality the potential V (x) fluctuates 
in time. For example, in §12.4 below we discuss scattering of electrons by 
atoms, and in this case the electrostatic potential V varies in time as the 
electrons that partly generate it whizz about the atom. These internal mo¬ 
tions cause rapid variability in the phases of V (q) , while affecting the power 
spectrum of V to a much smaller extent: the latter depends on the number 
and structure of the lumps associated with the electrons and nucleus, rather 
than on their locations. Thus scattering experiments enable us to unveil as 
much of the structure of matter as we are in practice interested in. For this 
reason they are one of the most powerful tools we can deploy in our efforts 
to understand nature. 


12.3.1 The optical theorem 

The simple connection between the power spectrum of V (x) and the scatter¬ 
ing cross-section established above relies on the Born approximation. This 
approximation is certainly not always valid, so it is interesting to see what 
we can say about cross-sections in general. 

From equation (12.8) we have that 

T + T^ = = - Jd 3 p"T^\p"){p"\T. (12.63) 


Squeezing this equation between (p'| and |p) and using equation (12.39), we 
find that the scattering amplitude f(p—>p') satisfies 

6(E P - E p ,){f(p p') - /*(p' p)} 

= J d 3 P "S(E P „ - E p ,)f( p' p") S(E P " - E p )f (p p"). 

(12.64) 



270 


Chapter 12: Scattering Theory 


10 


i 


rO 

1 0.1 
c; 

TJ 

\ 

b 

T5 

0.01 


0.001 


0 50 100 

e 

Figure 12.2 The differential cross-section for neutron-proton scattering at two values of 
the centre-of-mass energy. Data obtained from M. Kreisler et. al., Phys. Rev. Lett., 16, 
1217, (1966). The diffraction peak at 6 = 0 can be understood in terms of the optical 
theorem. 



The second delta function in the integral ensures that E v " = E p , so we can 
replace the first delta function by S(E p — E p i), and then bring it outside the 
integral since it no longer depends on p". Then we have 


/( P -> P')-/*(P' ->P) = jd?p" 8(E p „-E p )f*(p' p")/(p -> p"). 

(12.65) 

When p' = p (equal directions as well as magnitudes), the left side becomes 
/(p —> p) — /*(p —> p) = 2i3m/(p —> p) and, after changing variables in the 
delta function to obtain S(E P » — E p ) = (m/p") 8(p" — p), equation (12.65) 
reduces to 


3m/(p ->• p) 


^ J d 3 p" ^jS(p-p")f*(p p")/(P -t P") 


( 12 . 66 ) 


In this last expression we recognise from equation (12.60) the total cross- 
section for scattering from a state of initial momentum p. We have derived 
the relation 

Att?) 

u to t(p) =-3m/(p -> p). (12.67) 

P 

This equation is known as the optical theorem, and relates the total cross- 
section to the imaginary part of the scattering amplitude in the forward 
direction. It is at heart a re-expression of equation (12.9) with an identity 
operator Y IV’iXV’il inserted after on the right. The forward scattering 
gives the probability a particle is removed from the original beam, and this is 
associated with the total probability the particle is deflected into some other 
direction. 

When neutrons are scattered from protons, the differential cross-section 
has a peak in the forward direction, as shown in Figure 12.2. As the centre- 
of-mass energy is raised, this peak increases in height and decreases in width. 
This behaviour is explained by the optical theorem. Experimentally, the total 
cross-section becomes roughly constant as p —> oo. Equation (12.67) then 
implies that 5m/(p — > p) rises roughly in proportion to p, so from equation 
(12.59) the differential cross-section in the forward direction grows at least 
as fast as p 2 . Conversely, since |/(p —> p')| 2 is necessarily positive, the total 
cross section <7 to t = fdfi |/(p — > p')| 2 is never less than the cross-section 
for scattering into any solid angle Af l < dn. Choosing Af l to be the region 
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around the forward direction in which |/(p — > p')| 2 is falling from its peak 
at p' = p, but still greater than i|/(p —p)| 2 , gives 

0tot > [ dft |/(p ->• p')| 2 > |An|/(p ->• p)| 2 > ±Afi|Sm/(p ->• p)| 2 . 

J An 

( 12 . 68 ) 

Hence, from the optical theorem, the fwhm of the peak around the forward 
direction is bounded by 

32tt 2 

AQ < 4^— (12.69) 

P 2 cr tot 

and therefore shrinks as p~ 2 as p —> oo. This diffraction peak is familiar 
from optics: collimated light can be diffracted by two slits, and the resulting 
intensity in the Fraunhofer region is peaked in the forward direction, with a 
fwhm that shrinks as the frequency of the light is increased. 


12.4 Scattering electrons off hydrogen 

We now apply our scattering formalism to a physical problem, namely scat¬ 
tering of electrons by a hydrogen atom that is in its ground state j 1,0, 0) 
(§8.1). Taking the proton to be a pointlike object at the centre of the atom, 
the atom’s charge distribution is 

p( r ) = e<5 3 (r) — e |(r|l, 0,0)| 2 . (12.70) 

From §8.1.2 we have that |(r11, 0,0)| 2 = e _2r / a ° /ttcIq where ao is the Bohr 
radius (eq. 8.15b). Hence, the atom is the source of an electric field E = 
—V$, where 


$(r) = 


1 


3 / P(r') 


47re 0 

e 


d 6 r‘ 


p -2r'/o 0 

3„/ e 


r — r 


dV 


2e 


47reo r 47raQeo 


clr'df? 


47reor 47reo7rag 

r' 2 sin0e _2r, /°° 


r — r 


(12.71) 


(r 2 + r' 2 — 2 rr' cos 9) 1 / 2 


The integral differs only trivially from that evaluated in Box 10.1. Adapting 
the result obtained there we conclude that 


$(r) = 


47T60 


- + — ) e~ 2r/ao . 
r a 0 


(12.72) 


Notice how the ground-state electron shields the pure 1 jr Coulomb potential 
of the proton, causing the overall potential to decline exponentially at large 
distances. This potential will scatter a passing charged particle such as an 
electron. It will turn out that our calculations only apply to electrons that 
have enough energy to excite or even ionise the atom. Never the less, we 
shall consider only the case of elastic scattering, in which the atom remains 
throughout in its ground state. 

Equation (12.61) gives the Born approximation for the differential cross 
section in terms of the Fourier transform of the interaction potential V (r) = 
—e < f>(r). By equation (12.72) V is a function of distance r only, and for any 
such function it is straightforward to show that 


j d 3 re iq ' r / ? W(r) = — J drrsin V(r). 
Substituting for V(r) = — e$(r) from equation (12.72) we find 
J d 3 r e-iq-r /n v ^ = ^n 2 8 + {qao/hf 


m e (4 + {qao/K) 2 ) 2 ' 


(12.73) 


(12.74) 



272 


Chapter 12: Scattering Theory 



Figure 12.3 Trigonometry of the 
isosceles triangle tells us the mag¬ 
nitude of momentum transfer, since 

Ip'I = IpI- 


Plugging this result into equation (12.61), we have finally 

da _ 2 ( 8 + (gao/71) 2 \ 2 

do a ° y(4+(qa 0 /Tiy-yJ ■ 


(12.75) 


Now g = |p 7 — p| = 2psin(0/2) (see Figure 12.3), so q is smallest and the 
cross-section is greatest for forward scattering (0 = 0). Quantitatively, 


der 


dft 


8=0 


-'■ 0 ; 


(12.76) 


independent of the incoming electron’s energy. When the electron’s mo¬ 
mentum is large, the cross-section drops sharply as we move away from the 
forward direction. This behaviour is in rough agreement with the optical 
theorem, although we should not expect equation (12.67) to hold exactly 
because we have used the Born approximation. 

We now check the validity of the Born approximation. The potential 
of equation (12.72) has a characteristic range ao. When an electron with 
momentum ~ p is aimed at the atom, it is within this range for a time of 
order St ~ ao m/p. Averaged over that time, the potential it experiences is 
of order 


V = 


drr 2 V(r) = — 


87re 0 a 0 


= - 71 , 


(12.77) 


where we have used the definition (8.15b) of ao and 1Z is the Rydberg constant 
(eq. 8.26). From the tdse the fractional change that V effects in its ket 
during this interval is of order S\ip)/\^j) ~ VSt/h. We expect the Born 
approximation to be is valid if this fractional change is small, that is, provided 


aoTO \V\ \/lZm e /2 

p Ti p 


Hence the inequality holds for electrons with energies 

&. » 


(12.78) 


(12.79) 


Since 1Z ~ 13.6 eV, while the rest-mass energy of the electron is m e c 2 ~ 
511 keV, there is a wide range of energy that is high enough for the Born 
approximation to be valid, yet small enough for the electron to be non- 
relativistic. In Figure 12.4 we plot the experimentally measured differential 
cross section alongside our estimate (12.75) from the Born approximation 
for three electron energies: 4.9, 30 and 680 eV. At the lowest energy the 
Born approximation is useless. At 30 eV ~ 27 Z the approximation works 
moderately well for back-scattering but seriously underpredicts the cross 
section for forward scattering. At 680 eV the approximation works well for 
all scattering angles. 
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Figure 12.4 Elastic e _ H scattering at electron kinetic energies E = 4.9, 30 and 680eV. 
The curves show the predictions of the Born approximation (eq. 12.75) while the points 
show experimental data from J.F. Williams, J. Phys. B, 8 no. 13 (1975). The accuracy of 
the Born approximation increases with energy. 


12.5 Partial wave expansions 

In §12.2 we introduced the S-matrix by squeezing the scattering operator 
between states |p) of definite momentum. This allowed us to evaluate the 
action of the free evolution operators Uk(t), because { |p}} is a complete set 
of eigenstates of Hk■ In §7.2.5 we saw that states | E, l, m) of definite angular 
momentum also form a complete set of eigenstates of Hk, so we could just 
as well consider the matrix (E', l', m'\S\E, l, to). 

From equation (12.37) we have that [Hk, 5] = 0, from which it fol¬ 
lows that {E 1 , l', m'\S\E, l, m) vanishes unless E’ = E. Also, if the scatter¬ 
ing potential is spherically symmetric, it follows from the work of §4.2 that 
[L,«S] = 0, so 


[L Z ,S] = 0 ; [L ± ,S}= 0 ; [L 2 ,S} = 0. (12.80) 

From the first and last of these commutators it follows that (E 1 , l',m'\S\E,l, to) 
vanishes unless in’ = to and V = l. Moreover, the second commutator implies 
that 6 


0 = {E,l, m\ [S, L + ]\E,l, m- 1) (12 81) 

oc (E, l, to|<S|.E, l, to) — (E, l, m — 1|<S|E, l,m — 1), 

so not only is S diagonal in the \E, l,m) basis, but (E, l, m\S\E, l, to) is 
actually independent of m. We can summarise these constraints on the S- 
rnatrix of a spherically symmetric potential by writing 

(E',l',m'\S\E,l,m) = 6{E - E')6 vl S mlmSl (E), (12.82) 

where si(E) is a number that depends on E and l. Finally, since the S-matrix 
is unitary, Si(E), must have unit modulus, so 

{E',l',m'\S\E,l,m) = S{E - E')6 vl 6 mlm e 2i5 ^ E \ (12.83) 

where all the remaining information is contained in the real phase shifts 
6i (E). This reduction of the whole scattering process to a mere set of phases 
makes the angular-momentum basis invaluable for scattering problems. 
Equation (12.83) implies that 

S\E,l,m) =e 2i5l{E) \E,l,m), (12.84) 


Recall from equations (7.15) that a_|_(m — 1) = ot—(m). 
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Box 12.1: The amplitude (p| E,l,m) 

We require the amplitude (p\E, l , in) that a particle that has well-defined 
energy E and angular momentum will be found to have momentum p 
and energy p 2 /2M. Since the quantity we seek is the momentum-space 
wavefunction of a particle of well-defined angular momentum, we simply 
repeat the work of §7.2.3 in the momentum representation. We have 

L, = l(p,x-p,y)=i (p^-P,£), 

where the operators are written in the momentum representation. We 
now introduce polar coordinates (p, p) for momentum space, and, in 
exact analogy to the derivation of equation (7.43), show that L z = i d/dp. 
This is simply minus the corresponding real-space result (7.43). It is easy 
to see that proceeding in this way we would obtain momentum-space 
representations of L± that differ from their real-space analogues (7.52) 
only in the substitutions 6 —> i9 and cf> —> ip and an overall change of 
sign. Consequently the momentum-space wavefunction of a state of well 
defined angular momentum must be p) = g(p)Y™*($, <p), where g(p) 
is as yet undetermined and the complex conjugate spherical harmonic is 
required because L z = +\d/dg>. If we require E to equal p 2 /2M, it is 
clear that g = GS(E — p 2 /2M). The constant of proportionality, G, is 
determined by the normalization condition 

S(E — E') = (E, l , m\E' 1 1 , Im) 

= G 2 J dpp 2 5(E - p 2 /2M)6(E' - p 2 /2M) J dfI|Y ; m | 2 

= G 2 M J d E p yj2ME p 6{E - E P )S(E' - E p ) 

= G 2 MV2ME5(E - E') = G 2 Mp5(E - E'). 

Thus G = ( Mp )" 1 / 2 . 


so if prior to scattering the particle is in the state | E, l, to), it will emerge from 
the scattering region in a state that differs only by the acquisition of an extra 
phase 2Si(E). This fact mirrors our finding in §5.3 that a one-dinrensional 
scattering process is entirely determined by the phase shifts of the even- and 
odd-parity solutions to the tise - which are the one-dinrensional analogues 
of states of well-defined angular momentum. 

The above discussion generalises straightforwardly to the case of parti¬ 
cles with non-zero spin, provided we replace the orbital angular momentum 
operator L with the total angular momentum operator J, and relabel the 
states and phase shifts accordingly. For simplicity, we will confine ourselves 
to scalar particles for the rest of this section. 

To relate the S-nratrix in the form of equation (12.83) to experimental 
cross-sections, we must calculate the scattering amplitude /(p —)• p'). Using 
equations (12.6) and (12.83) we obtain 

<p'|T|p>= ]T JdEdE' (p'\E\l , ,m , )(E\l',m'\T\E,l,m)(E,l,m\p) 

I'lm'm 

= J dE ( p'\ E ,l,m)(E,l,m\p){e 2lSl{E) - 1). 


(12.85) 


In Box 12.1 we show that 


<P| E,l,m) = (Mp)- 1/2 S(E - E P )YT*(0,?), 


where ($, </?) are the polar coordinates of p. 


( 12 . 86 ) 
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If we align the z-axis with the beam direction, the initial state is un¬ 
changed by rotations around the 2 -axis. Equation (7.38) with a = z then 
tells that the initial state has m = 0, so {pz\E,l,m) vanishes unless to = 0. 
Thus for a beam in this direction 


(EJ,m\p) = 6{E J M - E) 6 m0 Y?(0). (12.87) 

In §7.2.3 we saw that Y°(d) is a real ^ th -order polynomial in cos'd: 


Y ?(t?) 


21 + 1 
4-7T 


P;(cOS$). 


( 12 . 88 ) 


We have, moreover, that P;(l) = 1. Consequently, equation (12.87) yields 


(E,l,m\p) = 5(E p 


E)S m o i 


I 21 + 1 

AirMp 


(12.89) 


Using equation (12.86) again to eliminate (p'| E,l,m) from equation (12.85), 
and then using equation (12.88) to eliminate Y°, we obtain 


(p'|T|p) = ^ f dE {p'\E, l, 0)(E, l, 0|p) (e 2i<5i(E p) - l) 

i •> 

= E | wIwi 5{E ~ E ’' )S{E - ~ - 1 ) 


hj s( - E p - E »')E^r SP ‘( cos ' , ')( e2 “ ,(1! ' -h 


2t xhM 


(12.90) 

where •&' is the angle between p and p'. Comparing this equation with the 
definition (12.39) of the scattering amplitude, we see finally that 


/(P ->* p') = ^(2Z + l)Pi(costi')fi(E p ), (12.91a) 

i 

where the partial-wave amplitude is defined to be 

p2i<5j (.£7) _ 1 % 

fi(E) = n ---= - e i5 ‘^ sin Si{E). (12.91b) 

2ip p 

The differential cross-section is just the mod square of the scattering 
amplitude, and because the spherical harmonics are orthonormal when inte¬ 
grated over all angles, the total cross-section is 


Otot = /d,|/(p^p ')| 2 

= 4tt^ y/(2V + l)(2l + l)MErf t {E) j dfiY°(0')Y?(0') (12g2) 

l’l J 

= 4tt^ 2(21 + l)\fi(E)\ 2 = 4tiTi 2 21 + 2 1 sin 2 Si(E). 

i i p 

This equation is often written as Otot = where the partial cross- 

section of order l, 


<Ji = 47r(2/ + 1) \fi(E )| 2 = dirTi 2 ^-^- sin 2 Si(E), 

pz 


(12.93) 
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is the cross-section for scattering a particle that has total squared angular 
momentum l(l + l)ti 2 . Clearly, the partial cross-sections are restricted by 

0 < cri < AttTi 2 —, (12.94) 

pZ 

with cr i only vanishing when the phase shift Si = nir. Notice from equations 
(12.91a) that 


2/ _i_ i 

9fm/(p-^p) = y'(2J + l)P,(l)9fm/,(£;) = y'- h sin 2 Si(E). (12.95) 

l l ‘ 

Comparison of this with equation (12.92) shows that the optical theorem 
(12.67) is explicitly satisfied in this basis. This fact follows from conservation 
of angular momentum - we have treated the incoming beam as a superpo¬ 
sition of states of well-defined angular momentum; since the potential is 
spherically symmetric, it cannot change the particle’s angular momentum, 
so each state of well-defined angular momentum scatters separately, and does 
so in conformity with the optical theorem. 

In the classical picture of scattering, the angular momentum of a particle 
of energy p 2 /2M is determined by its impact parameter b , which is the 
distance between the scattering centre and the straight line tangent to the 
incoming trajectory. Quantitatively, the angular momentum has magnitude 
L = bp. Large b corresponds to a glancing collision and a small scattering 
angle, while at small b the encounter is nearly head-on and the particle is 
liable to back-scatter. Thus we expect the differential cross section /;(p —> 
p') to be largest for •&' ss 0 when l is large, and for •&' ss 7r to be largest when 
l « 0. The partial cross section cr; is expected to decrease as l, and therefore 
6, increases. 


12.5.1 Scattering at low energy 

At low energy, p is small and for l > 0 the classical impact parameter b = L/p 
becomes large. Hence we expect low-energy scattering to be dominated by 
the partial wave with 1 = 0. In this subsection we show that this naive 
expectation is borne out by our quantum-mechanical formulae. 

To discover how a given particle is actually scattered, we must relate 
the phase shifts Si (E) to the scattering potential V(r). Since the free state 
| E,l,m) is an eigenstate of Hk , L 2 and L z , from equation (7.70) it follows 
that 7 


\_d_ 

r 2 dr 


[f^ Y l E ’ Z ’ m >) 


1(1 + 1 ) 

j.2 


2mE\^ 

) 


(r| E,l,m). 


(12.96) 


When l ^ 0, the angular momentum term dominates the right side near 
the origin, and one can easily show that (r| E,l,m) ~ r l for small r (Prob¬ 
lem 12.8). Consequently, there is only a very small probability of finding a 
particle that has high angular momentum near the origin. This reasoning 
breaks down when the second term on the right side becomes important. For 
energy E = p 2 /2m this occurs at r ss 1%/p, which for large l coincides with 
the classical impact parameter. Suppose that the scattering potential acts 
over some characteristic length R , beyond which it is negligible - for exam¬ 
ple, in the case of the potential (12.72), R is of the order of a few Bohr radii. 
If R <C lh/p, then V(r) is only strong in a region where a free wavefunction 
would be very small. In this case, the I th partial wave will scarcely be af¬ 
fected, so Si ~ 0. Roughly, the only states that suffer significant scattering 
are those with angular momenta in the range lh < pR , so for low incoming 

7 In fact, (r| E,l,m) = jt(kr)Y™(9, <p), where k = y/2ME/h and ji is the I th spherical 
Bessel function, but we do not need this result here (see Problem 12.8). 
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momenta the total scattering amplitude can be well approximated by the 
first few terms in the infinite sum (12.91a). 

In fact, we can see quite generally that only the lowest l states make 
significant contributions to the low-energy cross-section. The scattering am¬ 
plitude /( p —> p') should be a smooth function of p and p' as p. p' —> 0, 
because at low energies the incoming particle has a large wavelength and can¬ 
not resolve any sharp features in the potential. Since in equation (12.91a) 
P; is an l th -order polynomial in cost? = p' • p /p'p, we see that if /( p —> p') 
is to be an analytic function of the Cartesian components of p and p' at low 
energies, the partial wave amplitude fi{E) must vanish with p at least as 
fast as 

lim fi(E) ~ (p'p) 1 = p 21 . (12.97) 

p—¥ 0 

The total cross section (12.92) then behaves as 

lim <Ttot = 47 t a 2 p 41 (12.98) 

i 

in terms of some constants a;, and can be well approximated by just the 
lowest few terms. In the extreme low energy limit, the only non-vanishing 
amplitude is fo(E p —> 0) = oo and the differential cross section 

ft S" a ° Po = a ° (12 -" } 


is isotropic. 

An eigenstate of the true Hamiltonian H with the same energy and 
angular-momentum quantum numbers as | E, l, m) has a radial wavefunction 
ui(r) that satisfies a version of equation (12.96) modified by the inclusion of 
a potential V(r). Writing ui(r) = Ui(r)/r we find 


d 2 H;(r) 

dr 2 


2 m E 

n 2 


Ui^ + VMr), 


(12.100a) 


where 


Kff(r) 


2 mV(r) : l{l + 1) 
h 2 + r 2 


(12.100b) 


For a general potential, we typically have to solve this equation numeri¬ 
cally, and then find the phase shifts by comparing our solution with equation 
(12.45) in the large r limit. However, we can obtain a heuristic understanding 
of the behaviour of the phase shift as follows. If the potential is attractive 
{V(r) < 0), Ui{r) will have a greater curvature, and hence oscillate more 
rapidly in the presence of V that it would have done if the particle were free. 
A potential with finite range R <C ITi/p only acts over a small part of a radial 
oscillation, so when V < 0, the wavefunction emerges from the interaction 
region slightly further along its cycle than a free wavefunction. On the other 
hand, when the potential is repulsive, Ui(r) has smaller curvature, so oscil¬ 
lates more slowly than a free wavefunction, emerging from the interaction 
region slightly behind. Equation (12.84) tells us that states emerge from the 
scattering process changed only in phase; we now see that the sign of the 
phase shift 5i will typically be opposite to that of the potential. 

As the magnitude of V increases, so does the difference in oscillation 
rates between interacting and free eigenstates, and hence at fixed energy 
\5i(E)\ likewise increases. When the potential is sufficiently strong, the in¬ 
teracting wavefunction can oscillate precisely half a cycle more (or less if 
V > 0) in the interaction region than it would do if the state were free, and 
then \5i(E)\ = n with the consequence that fi(E) 0. In these circum¬ 
stances this angular-momentum state suffers no scattering at all. 

In §10.3, we saw that atoms of a noble gas such as argon are chemically 
inert because in their ground states they have spherically-symmetric distri¬ 
butions of electron charge, and they have no low-lying excited states that 
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Figure 12.5 An exponential attrac¬ 
tive force combined with centrifugal 
repulsion generates a minimum in 
the effective potential. 


can be mixed in by a perturbation to generate a less symmetrical charge dis- 
tribution. As a consequence, these atoms generate negligible electric fields 
beyond some limiting radius R ~ ao that contains nearly all the probability 
density of the outermost shell of electrons. At r < R there is a significant 
electric field, and any particle that penetrates this region will be appreciably 
scattered, but particles that have larger impact parameters will be negligi¬ 
bly scattered. At energies low enough that Rp <C h, scattering from states 
with l > 0 is negligible, while the considerations of the previous paragraph 
suggest that there could be an energy at which there is also no scattering 
from states with l = 0. Then an electron is not scattered at all. Exactly 
this Ramsauer—Townsend effect was observed before the development of 
quantum mechanics 8 when electrons of energy ~ 0.7 eV were discovered to 
pass undeflected past noble-gas atoms. 


12.6 Resonant scattering 

Nuclear physics involves a combination of short- and long-range forces: the 
strong interaction that binds protons and neutrons into nuclei has only a 
short range, while the electrostatic repulsion between protons has a long 
range. Figure 12.5 illustrates the fact that when a short-range attractive 
force is combined with a long-range repulsive force, the overall effective po¬ 
tential V e f{ of equation (12.100b) is likely to have a local minimum. In §5.3.3 
we studied scattering by a one-dimensional potential that contained such a 
potential well and demonstrated that a plot of the scattering cross-section 
versus energy would show sharp peaks at the energies that allow the particle 
to be trapped in the well for an extended period of time. The method of 
partial waves allows us to consider the physics of temporary entrapment in 
the much more realistic case of scattering in three dimensions. We shall find 
not only that the results of §5.3.3 largely carry over to realistic scattering 
potentials, but we are able to extend them to include a quantitative account 
of the delay between the particle reaching the potential well and its mak¬ 
ing good its escape. Physicists have learnt much of what is known about 
the structure of both nuclei and baryons such as protons and neutrons by 
exploiting the connections between bound states and anomalous scattering 
cross sections that emerges from this section. 

Equation (12.46) gives the wavefunction at late times of a particle that 
was initially in the free state |</>). It breaks the wavefunction into two parts. 
The first is a sum of plane waves </>(p)e 1< ' p ' r ~~ Et )/ ?i . If |0(p)| 2 has a well-defined 
peak at momentum p, from §2.3.3 we know that the amplitude of this first 
contribution peaks on the plane p • r = pt/m. To determine the location at 

8 C. Ramsauer, Ann. Physik, 4, 64 (1921); V.A. Bailey & J.S. Townsend, Phil. Mag., 
42. 873 (1921). 
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which the amplitude of the second part peaks, we observe that as t —> oo, 
the phase of the exponential in equation (12.46) varies extremely rapidly 
as a function of momentum, causing the sign of the integrand to oscillate 
quickly. If 4>(p) is a smooth function, the integral will be dominated by the 
contribution from momenta at which the phase is stationary. To find these 
points, we must take into account the phase of /(p —» pr). By equations 
(12.91), the scattering amplitude for each partial wave is real except for 
a factor e I<5i ^ Bp - ) . Hence the dominant contribution to the second term in 
equation (12.46) arises when 

O r\ 

— {pr — E p t + hSi(E p )} = 0 i.e. when r =—t — Ti—5i(E p ). (12.101) 
op m op 

We see that if the phase shift 8i(E p ) increases sharply for momenta near the 
average momentum of the initial state, the amplitude of the wave will be 
concentrated at a smaller radius than the incoming wave would have been. 
Consequently, there will be two distinct peaks in the probability of a particle 
reaching a detector at some distance from the scattering centre. The first 
is associated with the possibility that the particle misses its target, and the 
second with the possibility that it hits and is temporarily trapped by it 
before making good its escape. Thus unstable bound states are associated 
with rapid increases with E p in the phase shift of the scattered particle. This 
conclusion mirrors our finding in §5.3.3 that temporary trapping of particles 
by a one-dinrensional well is associated with rapid variations with energy in 
the phases </> and <f>' of the states of well-defined parity. 

We can model a dramatic increase of Si(E) by postulating that, for 
energy near some value Er, the phase shift behaves as 

^ tan" 1 (-^0 , (12.102) 


where the fixed energy scale T is included to ensure that the argument of the 
inverse tangent is dimensionless, and must be positive if we want dSi/dp > 0. 
In this model the phase shift rapidly increases by n as the energy increases 
through Er. Using the model in equation (12.101), we find that the time 
delay between the two peaks in the probability density of the particle hitting 
a given detector is 


. mh d . 

A t = — ^-5i{E) 
p Op 


mhdE d _ 1 ( —T/2 \ 
P dp dE tein \E-Er) 
hT/2 


(12.103) 


We infer from this delay that the lifetime of the quasi-bound state is ss h/T in 
agreement with the much less rigorous conclusion that we reached in §5.3.3. 

Calculating T for a physically realistic potential usually requires numer¬ 
ical analysis. However, since the lifetime of the quasi-bound state increases 
as T decreases, we anticipate that smaller values of T correspond to deeper 
minima in the potential: a deeper well traps the particle for longer. The 
limiting case T —> 0 + implies that the delay in emergence becomes infinite. 
We interpret this to mean that the dip in V is just deep enough to genuinely 
bind an incoming particle. 

If V is so deep that there is a state that has a strictly negative energy, 
the final state may not resemble the initial free state. For example, the 
incoming particle may get trapped for good in the potential, or it may knock 
out another particle that is already trapped (as in ionisation of an atom). 
In such cases, the scattering is said to be inelastic and the methods of this 
chapter must be extended 9 . 


9 The difficulty is not too severe. True bound states have energy E < 0 whereas all 
free states must have energy E > 0. Hence, if |b) is bound and | <f>) is free, (b|rp) = 0 so H 
acts on a larger Hilbert space than does Hr . Including these extra states carefully allows 
us to treat bound states. (See also Problem 12.6.) 
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Returning to the case T > 0, we now investigate how the cross-section is 
affected by the delayed emergence of our particle. From equations (12.6) and 
(12.46) the wavefunction of the scattered particle in the asymptotic future is 


(r|T|0;t) 


f d 3 p , e Kpr-Et)/h 

J (2^)3/2^ (p) r 


/(P -t£>r) 


l 


f d 3 p ,, e i(pr-Et)/h 

J (2 tt?).) 3 / 2< ^ P) r 




(12.104) 

where the second line uses equation (12.91a) to relate /( p —> p') to the 
partial-wave amplitudes To derive the cross-section (12.59), we as¬ 

sumed that the scattering amplitudes are more slowly varying functions of 
p than the wavepacket </>(p) - see the discussion after equation (12.54). In 
the presence of a resonance, this approximation may break down. Indeed, 
when one of the phase shifts 6i(E) has the form of equation (12.102), from 
equation (12.91b) the corresponding partial wave amplitude is 


fi(E) = -e i5 'W sin Si(E) 
P 


h r /2 
pE-E R + ir/2’ 


(12.105) 


and for small T this varies rapidly when E ss E R . We assume that the 
incoming wavepacket <f>( p) contains states |p) that are restricted in energy to 
a range of width A and consider first the case A <C T in which the resonance 
is broader than the uncertainty in the energy of the incoming particle. 


12.6.1 Breit-Wigner resonances 

If A <C r, then fi(E) varies slowly with energy in comparison to <j>( p), 
and equation (12.93) for the partial cross-sections 07 applies. Again using 
equation (12.102), we have (cf. eq. 5.64a) 


&i(E) = 47t( 2Z + l)|/j| 2 


Anh 2 ( 2 Z + l)(r/ 2 ) 2 
P 2 (E - E R ) 2 + (r/ 2) 2 ‘ 


(12.106) 


A peak in the cross-section that follows this famous formula is said arise 
from a pure Breit-Wigner resonance. Breit-Wigner resonances are eas¬ 
ily detected in plots of a cross-section versus energy and are the experimental 
signature of quasi-bound states in the scattering potential. Figure 12.6 is a 
plot of equation (12.106). Notice that the energy dependence is a combi¬ 
nation of the slow decline with E associated with the factor p~ 2 and the 
peak that arises from the Lorentzian final factor - such factors are familiar 
from the theory of a damped harmonic oscillator (Box 5.1). If T <C E R , 
the factor p~ 2 changes very little over the width of the bump, and the res¬ 
onance curve falls to half its maximum height when E ~ E R ± r/2. Thus 
the resonance lifetime Ti/T can be determined from the fwhm of the peak 
in the cross-section. This result explains why we needed to restrict ourselves 
to superpositions with A C T: in order to resolve the Breit-Wigner curve 
experimentally, there had better be a good chance that our particle’s energy 
lies within T of E R , where all the action lies. 


12.6.2 Radioactive decay 

The width T of a very long-lived resonance may be so small that our experi¬ 
mental apparatus cannot generate incoming particles with sufficiently small 
uncertainty A in the energy to resolve the curve of Figure 12.6. Then, using 
equation (12.86), we expand the momentum amplitudes 4>(p) of the initial 
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E/r 

Figure 12.6 The Breit-Wigner formula for a scattering cross section in the presence of a 
resonance. Here = 10T and the cross section is normalised by s = 2(2 1 + l)h?/mT. 


state as 


0 (p) = 5Z / d E' (p| E',l,m)(E',l,m 

lm 

= j2 Y r(p) 


lm 


, { E, l, m\q 

yJMp 


(12.107) 


where E = 


2 M 


Suppose for simplicity that, near some energy E R , only one partial wave 
amplitude has the form of equation (12.105), the others all being negligi¬ 
ble by comparison. Then, ignoring any angular dependence, the final-state 
wavefunction (12.104) of the scattered particle contains a factor 


(r|T|^;t)oc Jdp 


p 2 (E, l, m\4>) Th e Hpr-Et)/h 
(27 rh) 3 / 2 (Mp) 1 / 2 2^ E - E R + iT/2 


Th 

2r 


d E 


(27 vh) 3 / 2 V p 


M, 

— {E,l,m\ 


e i(pr-Et)/h 

' E ~ E R + iT/2~ 


(12.108) 


If the initial state \<j>) has average energy around {H K ) ~ E R , but is a super¬ 
position of states with different energies, smooth over a range A T, we may 
approximate p~ 1 ^ 2 (E, l, ?n|</>) by its value at resonance, p R 1 ^ 2 (E R , l, m\4 >} and 
bring it outside the integral. Similarly, near resonance we approximate p by 


, d P / P P n E — E R 

P -Pr + ~ E n> = PR 


VR 


where v R = p R /M, so 


gi (pr-Et)/n ^ e i(pRr-E R t)/h e i(E-E R )(r/v R -t)/h 

Substituting these approximations into equation (12.108), we find 


(12.109) 


( 12 . 110 ) 


(r|T|</>; t) 


Hi {E R ,l,m\<t>) e i (P Rr -- ERt )A e i(E-E R )(r/v K -t)/h 


2^ /2 { 2 ttH ) 3 / 2 


d E- 


E~E R + iT/2 ' 

( 12 . 111 ) 

The remaining integral can be done by contour integration. Since the denom¬ 
inator is large except near E = E R , we can extend the range of integration 
to —oo < E < oo, without drastically affecting the integral. If r > v R t, we 
close the contour in the upper half complex E plane. Since the only pole is 
in the lower half-plane, the integral evaluates to zero. If r < v R t, we close 
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the contour in the lower half-plane. Evaluating the residue of the pole at 
E = E r — ir/2, we conclude that 


(r|T|<M> 


ir 


2(27t?iur) 1 / 2 


(Er, l, 


cKPRr-E R t)/h 


-T(t-r/vn)/2h 


( 12 . 112 ) 


Consequently, 


l(r|r|^;t)| 2 


0 if r > Ur t 

T 2 \{ER,l,m\(p )\ 2 c _ r(t _ r/vn)/n 
8nhvR r 2 


(12.113) 

otherwise. 


The physical interpretation of this equation is the following. The probability 
density |(r|T|</>; t )\ 2 is zero before time t' = r/v r because the particle travels 
radially outwards at speed ~ vr. Subsequently, the probability of finding the 
particle anywhere on a sphere of radius r decays exponentially as e _r( - t ' t . 

This result provides a remarkable explanation of the law of radioactive 
decay: we interpret the emission of a neutron by an unstable nucleus as the 
endpoint of a scattering experiment that started months earlier in a nuclear 
reactor, where the nucleus was created by absorption of a neutron. More 
dramatic is the case of 238 U, which decays via emission of an a-particle to 
234 Th with a mean life Ti/Y ~ 6.4 Gyr. Because T/h is tiny, the probability 
(12.113) is nearly constant over huge periods of time. Our formalism tells 
us that if we were to scatter a-particles off 234 Th, they would all eventually 
re-emerge, but only after a delay that often exceeds the age of the universe! 
Thus 238 U is really a long-lived resonance of the (cc, 234 Th) system, rather 
than a stationary state. It is only because the timescale Ti/T is so long 
that we speak of 238 U rather than a resonance in the ( a , 234 Th) system. In 
fact, 234 Th is itself a resonance, ultimately of Pb. The longevity of 238 U is 
inevitably associated with a very small probability that 238 U will be formed 
when we shoot an cc-particle at a 234 Th nucleus. To see this notice that 
the final-state wavefunction (r|<S|</>;t) = (r|^>;t) + (r|T|</>;t), also involves an 
unscattered piece. On account of the smallness of T, the ratio of probabilities 


Prob(a is trapped) 
Prob(a unscattered) 


r 2 m 

hpR 


\(Er, l, m |</>)| 2 


(12.114) 


is extremely small. Hence it is exceptionally difficult to form 238 U by firing a- 
particles at 234 Th nuclei. Naturally occurring 238 U was formed in supernovae, 
where the flux of a-particles and neutrons was large enough to overcome this 
suppression. 


Problems 

12.1 Show that the operators 12± defined by equation (12.23) obey 


HYl±= YI±(Hk ± ie) =F ie. (12.115) 


12.2 Obtain the first and second order contributions to the S-matrix from 
the Feynman rules given in §12.3. 

12.3 Derive the Lippmann Schwinger equation 


l±> 


I E) + 


1 

E-H k ± ie 


V\±), 


(12.116) 


where |±) are in and out states of energy E and \E) is a free-particle state of 
the same energy. In the case that the potential V = Vo|x)(xl for some state 
|x) and constant Vo, solve the Lippmann-Schwinger equation to find (xl±)- 
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12.4 A certain potential V(r) falls as r~ n at large distances. Show that 
the Born approximation to the total cross-section is finite only if n > 2. Is 
this a problem with the Born approximation? 

12.5 Compute the differential cross section in the Born approximation for 
the potential U(r) = Vq exp(— r 2 /2ro). For what energies is the Born ap¬ 
proximation justified? 

12.6 When an electron scatters off an atom, the atom may be excited (or 
even ionised). Consider an electron scattering off a hydrogen atom. The 
Hamiltonian may be written as H = Hq + Hi where 


Hn = 


Pi 


p\ 


2m 4 ^ 60^1 2 to 


(12.117) 


is the Hamiltonian of the hydrogen atom (whose electron is described by co¬ 
ordinate ri) together with the kinetic Hamiltonian of the scattering electron 
(coordinate r 2 ), and 


—— ( --- 

47re 0 V l r i — r 2 1 r 2 ) 


(12.118) 


is the interaction of the scattering electron with the atom. 

By using Hq in the evolution operators, show that in the Born approx¬ 
imation the amplitude for a collision to scatter the electron from momen¬ 
tum p 2 to p 2 whilst exciting the atom from the state \n,l,m) to the state 
\n', l',m') is 


/(p 2 ; n, l, m -4 P ' 2 ;n',l',m') 

4 tt 2 ji r TY) r 

= - ( 27 r fi )3 J d 3 rid 3 r 2 e“ lq 2 ' I ’ 2 (?r / , l', m / |ri)(ri|n, l, m).ffi(ri, r 2 ), 

(12.119) 

where q 2 is the momentum transferred to the scattering electron. (Neglect 
the possibility that the two electrons exchange places. You may wish to 
perform the d 3 ri integral by including a factor e~ ari and then letting a —> 0 .) 

Compute the differential cross-section for the j 1, 0,0) -» |2,0, 0) transi¬ 
tion and show that at high energies it falls as cosec 12 ( 0 / 2 ). 

12.7 Use the optical theorem to show that the first Born approximation is 
not valid for forward scattering. 

12.8 A particle scatters off a hard sphere, described by the potential 


V(r) = { 


00 

0 


for |r| < a 
otherwise. 


( 12 . 120 ) 


By considering the form of the radial wavefunction u(r ) in the region r > a, 
show that the phase shifts are given by tan 5i = ji(ka)/ni(ka), where k = 
\PlrnE/h and ji(kr) and ni(kr) are spherical Bessel functions and Neumann 
functions, which are the two independent solutions of the second-order radial 
equation 


jLA 

r 2 dr 


r 2 -^«(r-) ) = 


1(1 + 1) 2 mE\ 

W J 


u(r). 


In the limit hr —?■ 0, show that these functions behave as 
(hr) 1 ... 21-1 


ji(kr) 


21 + 1 


ni(kr) 


(hr 


ii+1 


( 12 . 121 ) 


( 12 . 122 ) 


Use this to show that in the low-energy limit, the scattering is spherically 
symmetric and the total cross-section is four times the classical value. 
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Problems 


12.9 Show that in the Born approximation the phase shifts 61 (E) for scat¬ 
tering off a spherical potential V (r) are given by 

pOO 

Si(E) ~ —2mkh 2 / dr r 2 V(r) (ji(kr )) 2 . (12.123) 

Jo 


When is the approximation valid? 

12.10 Two a-particles collide. Show that when the a-particles initially 
have equal and opposite momenta, the differential cross-section is 

^ = \m+f(e-n)\ 2 . (12.124) 

Using the formula for f{9) in terms of partial waves, show that the differential 
cross-section at 0 = 7r/2 is twice what would be expected had the a-particles 
been distinguishable. 

A moving electron crashes into an electron that is initially at rest. As¬ 
suming both electrons are in the same spin state, show that the differential 
cross-section falls to zero at 6 = 7r/4. 
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Appendices 


Appendix A: The laws of probability 

Events are frequently one-offs: Pretty Lady will run in the 4.20 at Sandown Park 
only once this year, and if she enters the race next year, her form and the field will 
be different. The probability that we want is for this year’s race. Sometimes events 
can be repeated, however. For example, there is no obvious difference between one 
throw of a die and the next throw, so it makes sense to assume that the probability 
of throwing a 5 is the same on each throw. When events can be repeated in this 
way we seek to assign probabilities in such a way that when we make a very large 
number N of trials, the number ua of trials in which event A occurs (for example 
5 comes up) satisfies 

ua^paN. (A.l) 

In any sequence of throws, the ratio tia/N will vary with N, while the probability 
Pa does not. So the relation (A.l) is rarely an equality. The idea is that we should 
choose pa so that tia/N fluctuates in a smaller and smaller interval around pa as 
N is increased. 

Events can be logically combined to form composite events: if A is the event 
that a certain red die falls with 1 up, and B is the event that a white die falls 
with 5 up, AB is the event that when both dice are thrown, the red die shows 1 
and the white one shows 5. If the probability of A is pa and the probability of B 
is pb, then in a fraction ~ pa of throws of the two dice the red die will show 1, 
and in a fraction ~ ps of these throws, the white die will have 5 up. Hence the 
fraction of throws in which the event AB occurs is ~ PaPb so we should take the 
probability of AB to be pab = PaPb . In this example A and B are independent 
events because we see no reason why the number shown by the white die could 
be influenced by the number that happens to come up on the red one, and vice 
versa. The rule for combining the probabilities of independent events to get the 
probability of both events happening, is to multiply them: 

p(A and B) = p(A)p(B) (independent events). (A.2) 

Since only one number can come up on a die in a given throw, the event A 
above excludes the event C that the red die shows 2; A and C are exclusive events. 
The probability that either a 1 or a 2 will show is obtained by adding pa and pc- 
Thus, for classical probability 

p(A or C) = p(A) + p(C) (exclusive events). (A.3) 

In the case of reproducible events, this rule is clearly consistent with the principle 
that the fraction of trials in which either A or C occurs should be the sum of the 
fractions of the trials in which one or the other occurs. If we throw our die, the 
number that will come up is certainly one of 1, 2, 3, 4, 5 or 6. So by the rule just 
given, the sum of the probabilities associated with each of these numbers coming 
up has to be unity. Unless we know that the die is loaded, we assume that no 
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number is more likely to come up than another, so all six probabilities must be 
equal. Hence, they must all equal -L Generalising this example we have the rules 

N 

With just N mutually exclusive outcomes, J2 Pi = 1 ' (AA\ 

i= 1 ( * ) 

If all outcomes are equally likely, pi = 1/N. 

Expectation values A random variable a; is a quantity that we can measure 
and the value that we get is subject to uncertainty. Suppose for simplicity that 
only discrete values Xi can be measured. In the case of a die, for example, x could 
be the number that comes up, so x has six possible values, xi = 1 to X6 = 6. If pi 
is the probability that we shall measure Xi, then the expectation value of x is 

{x) = ^2,PiXi. (A.5) 

i 

If the event is reproducible, it is easy to show that the average of the values that 
we measure on N trials tends to ( x) as N becomes very large. Consequently, ( x) 
is often referred to as the average of x. 

Suppose we have two random variables, x and y. Let pij be the probability 
that our measurement returns Xi for the value of x and yj for the value of y. Then 
the expectation of the sum x + y is 

(x + y) = Y^ Pij (*i + yj) = Y^ Pij x i + PvVi ( A -6) 

ij ij ij 

But ^2 Pij is the probability that we measure Xi regardless of what we measure 
for y, so it must equal pi. Similarly JV p, : j = p : j , the probability of measuring yj 
irrespective of what we get for x. Inserting these expressions in to (A. 6) we find 

(x + y) = {x) + (y). (A.7) 

That is, the expectation value of the sum of two random variables is the sum of 
the variables’ individual expectation values, regardless of whether the variables are 
independent or not. 

A useful measure of the amount by which the value of a random variable 
fluctuates from trial to trial is the variance of x: 

((x - (x)) 2 ) = (x 2 ) -2{x (x)) + ((x) 2 ) , (A.8) 

where we have made use of equation (A.7). The expectation (x) is not a random 
variable, but has a definite value. Consequently (x (x)) = (x) 2 and ((x) 2 ) = (x ) 2 , 
so the variance of x is related to the expectations of x and x 2 by 

(A*) = {(x — (x)) 2 ) = (x 2 ) — (x) 2 . (A.9) 
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Vector notation is very powerful, but sometimes it is necessary to step outside 
it and work explicitly with the components of vectors. This is especially true in 
quantum mechanics, because when operators are in play we have less flexibility 
about the order in which we write symbols, and standard vector notation can be 
prescriptive about order. For example if we want p to operate on a but not b, 
we have to write b to the left of p and a on the right, but this requirement is 
incompatible with the vectorial requirements if the classical expression would be 
p x (a x b). The techniques of Cartesian tensors resolve this kind of problem. 
Even in classical physics tensor notation enables us to use concepts that cannot be 
handled by vectors. In particular, it extends effortlessly to spaces with more than 
three dimensions, such as spacetime, which vector notation does only in a limited 
way. 

Instead of writing a, we write cn for the j th component of the vector. Then 
a ■ b becomes JVdi&i. When a subscript is used twice in a product, as i is here, 
it is generally summed over and we speak of the subscript on a being contracted 
on the subscript on b. 

The ij th component of the 3x3 identity matrix is denoted S t j and called the 

Kronecker delta: so 


5 


1 if i = j 
0 otherwise. 


(B.l) 
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The equation m = ^ • Sijdj expresses the fact that the identity matrix times a 
equals a. The scalar product often appears in the form a ■ b = JT . SijCiibj. To see 
that this is equivalent to the usual expression, we do the sum over j. Then the 
delta vanishes except when j = i, when it is unity, and we are left with JT qibi. 
Notice that it does not matter in what order the symbols appear; we have also 
a • b = JTj UiSijbj, etc. - when using Cartesian tensors, the information that in 
vector notation is encoded in the positions of symbols is carried by the way the 
subscripts on different symbols are coupled together. 

To make the vector product we need to introduce the alternating symbol 
or Levi—Civita symbol tijk ■ This is a set of 27 zeros and ones defined such that 
ei 23 = 1 and the sign changes if any two indices are swapped. So £213 = —1, 
£231 = 1, etc. If we cyclically permute the indices, changing 123 into first 312 and 
then 231, we are swapping two pairs each time, so there are two cancelling changes 
of sign. That is, £123 = £312 = £231 = 1 and £213 = £321 = £132 = — 1. All the 
remaining 21 components of the alternating symbol vanish, because they have at 
least two subscripts equal, and swapping these equal subscripts we learn that this 
component is equal to minus itself, and therefore must be zero. 

We now have 

(a xb)i = ^ eijkCijbk- (71-2) 

jk 

To prove this statement, we explicitly evaluate the right side for i = 1,2 and 3. 
For example, setting i — 1 the right side becomes tijkdjbk- In this sum tijk 
is non-vanishing only when j is different from k and neither is equal one. So there 
are only two terms: 

£123(1263 + £132(1302 = <1263 — <J3&2 (B.3) 

which is by definition the third component of a x b. 

A few simple rules enable us to translate between vector and tensor notation. 

1. Fundamentally we are writing down the general component of some quantity, 
so if that quantity is a vector, there should be one subscript that is “spare” 
in the sense that it is not contracted with another subscript. Similarly, if the 
quantity is a scalar, all indices should be contracted, while a tensor quantity 
has two spare indices. 

2. Each scalar product is expressed by choosing a letter that has not already 
been used for a subscript and making it the subscript of both the vectors 
involved in the product. 

3. Each vector product is expressed by choosing three letters, say i,j and k and 
using them as subscripts of an t. The second letter becomes the subscript 
that comes before the cross, and the third letter becomes the subscript of the 
vector that comes after the cross. 

We need a lemma to handle vector triple products: 


^ ' tijkt-irs 
i 


— Sjr&ks bkrbj 


(B.4) 


Before we prove this identity (which should be memorised), notice its shape: on 
the left we have two epsilons with a contracted subscript. On the right we have 
two products of deltas, the subscripts of which are matched “middle to middle, 
end to end” and “middle to end, end to middle”. Now the proof. For the sum on 
the left to be non-vanishing, both epsilons must be non-vanishing for some value 
of i. For that value of i, the subscripts j and k must take the values that i does 
not. For example, if i is 1, j and k must between them be 2 and 3. For the same 
reason r and s must also between them be 2 and 3. So either j = r and k = s 
or j = s and k = r. I 11 the first case, if ijk is an even permutation of 123, then 
so is irs, or if ijk is an odd permutation, then so is irs. Hence in the first case 
either both epsilons are equal to 1 , or they are both equal to —1 and their product 
is guaranteed to be 1. The first pair of deltas on the right cover this case. If, on 
the other hand, j = s and k = r, irs will be an odd permutation of 123 if ijk is 
an even one, and vice versa if ijk is an odd permutation. Hence in this case one 
epsilon is equal to 1 and the other is equal to —1 and their product is certainly 
equal to —1. The second product of deltas covers this case. This completes the 
proof of equation (B.4) because we have shown that the two sides always take the 
same value no matter what values we give to the subscripts. 

Besides enabling us to translate vector products into tensor notation, the 
alternating symbol enables us to form the determinant of any 3x3 matrix. I 11 fact, 
this is the symbol’s core role and its use for vector products is a spinoff from it. 
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The simplest expression for det(A) is 

det(A) — y^ j e i jkAuA 2 jA 3k . (B.5) 

ijk 

A more sophisticated expression that is often invaluable is 

det^Aje rs t — ^ -d r t. -d .sy -4 /h - (B.6) 

ijk 

These expressions are best treated as the definition of det(A) and used to derive 
the usual rule for the evaluation of a determinant by expansion down a row or 
column. This popular rule is actually a poor way to define a determinant, and a 
dreadful way of evaluating one. It should be avoided whenever possible. 


Appendix C: Fourier series and transforms 

The amplitude for a particle to be located at x and the amplitude for the particle 
to have momentum p are related by Fourier transforms, so they play a significant 
role in quantum mechanics. In this appendix we derive the basic formulae. Like 
Fourier 1 himself we start by considering a function of one variable, f(x), that is 
periodic with period L: that is, /(x + L) = f(x) for all x. We assume that / can 
be expressed as a sum of sinusoidal waves with wavelength L: 


/(*)= I] 


F n e 


2-Kinx / L 


(C.l) 


n= — oo 


where the F n are complex numbers to be determined. At this stage this is just 
a hypothesis - only 127 years after Fourier introduced his series did Stone 2 prove 
that the sum on the right always converges to the function on the left. However 
numerical experiments soon convince us that the hypothesis is valid because it is 
straightforward to determine what the coefficients F n must be, so we can evaluate 
them for some specimen functions / and see whether the series converges to the 
function. To determine the F n we simply multiply both sides of the equation by 
e- 2 -n-i mx/L an( j integrate from —Lj 2 to L/2: 3 


fL /2 


da:e 


— 27ri mx / L 


I-L/2 


/(*) = ]T 


n= — o o 


(■L/2 

/ dxe ^K n ~m)x/ L 

I-L/2 (C.2) 


= LF m 


where the second equality follows because for n A 171 the integral of the exponential 
on the right vanishes, so there is only one non-zero term in the series. Thus the 
expansion coefficients have to be 

1 (-r/2 

F m = ±r I d xe~™ mx/L f(x). 


I-L/2 


In terms of the wavenumbers of our waves, 

2nn 

IP 

our formulae become 


kn — 


/(*) = Y Fne ' k 


F — — 
L 


r l/2 


dxe 


-L/2 


7 (*) 


(C.3) 

(C.4) 

(C.5) 


At this stage it proves expedient to replace the F n with rescaled coefficients 

7(fc„) = LF n . (C.6) 

so our equations become 


f(x)= J2 


n= — oo 




f(km)= f dze lkmX f{x). (C.7) 

J-L/2 


1 After dropping out from a seminary Joseph Fourier (1768-1830) joined the Auxerre 
Revolutionary Committee. The Revolution’s fratricidal violence led to his arrest but he 
avoided the guillotine by virtue of Robespierre’s fall in 1794. He invented Fourier series 
while serving Napoleon as Prefect of Grenoble. His former teachers Laplace and Lagrange 
were not convinced. 

2 Marshall Stone strengthened a theorem proved by Karl Weierstrass in 1885. 

3 You can check that the integration can be over any interval of length L. We have 
chosen the interval (— ^L, ^L) for later convenience. 
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Now we eliminate L from the first equation in favour of the difference d k = fc n +i — 
k„ = 27 t/L and have 

00 c\k~ ~ r L t 2 

f(x) = 2 _ /( fc ») elfe ”' X ; f{km)= dxe~ lkmX f(x). (C.8) 

n =-00 n J-L /2 

Finally we imagine the period getting longer and longer without limit. As L grows 
the difference d k between successive values of k n becomes smaller and smaller, so 
k n becomes a continuous variable k, and the sum in the first equation of (C.8) 
becomes an integral. Hence in the limit of infinite L we are left with 

/ oo j7 _ _ roo 

g f(k)e ikx ; m=J dxe~ ikx f(x). (C.9) 

These are the basic formulae of Fourier transforms. The original restriction to 
periodic functions has been lifted because any function can be considered to repeat 
itself after an infinite interval. The only restriction on / for these formulae to be 
valid is that it vanishes sufficiently fast at infinity for the integral in the second of 
equations (C.9) to converge: the requirement proves to be that f^° dx |/| 2 exists, 
which requires that asymptotically |/| < |x| _1,/2 . Physicists generally don’t worry 
too much about this restriction. 

Using the second of equations (C.9) to eliminate the Fourier transform / from 
the first equation, we have 

/ °° j ;1 roo 

g J dz'e ifc <*->/(*')■ (C-10) 

Mathematicians stop here because our next step is illegal. 4 Physicists reverse the 
order of the integrations in equation (C.10) and write 

/ oo roo n 

dx'f(x')j (C.ll) 

Comparing this equation with equation (2.41) we see that the inner integral on the 
right satisfies the defining condition of the Dirac delta function, and we have 

5{x-x')= r ^-e ik{x - x '\ (C.12) 

J -00 27r 
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In classical statistical mechanics we are interested in the dynamics of a system with 
N degrees of freedom. We do not know the system’s state, which would be quan¬ 
tified by its position q = (qi,.. . ,qN) and the canonically conjugate momentum 
p. Our limited knowledge is contained in the probability density ip(q,p), which is 
defined such that the probability of the system being in the elementary phase-space 
volume dr = d'^qd^p is ip(c 1, p) dr. 

Over time q and p evolve according to Hamilton’s equations 


. dH . dH 
q= ; P = --r—, 

dp dq 

and ip evolves such that probability is conserved: 


. dip d .. , d 

0= dF + ^-^+dF 


dH_ 
dt dp 


dip 

dq 


dH 

dq 




(pVO 

dip 

dp 


(D.l) 


(D.2) 


where the second equality follows from substituting for q and p from Hamilton’s 
equations, and the last line follows from the definition of a Poisson bracket: if 


4 It is legitimate to reverse the order of integration only when the integrand is abso¬ 
lutely convergent, i.e., the integral of its absolute value is finite. This condition is clearly 
not satisfied in the case of e lkx . By breaking the proper rules we obtain an expression 
for an object, the Dirac delta function, that is not a legitimate function. However, it is 
extremely useful. 
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Box D.l: Classical operators for a single particle 

In the simplest case, our system consists of a single particle with Hamiltonian 

H = ^p 2 /m + V(x). Then the operators associated with p x , x, H and L z are 

d d 

p x =-ih{-, p x } = ~ih— ; x =-ih{-,x} = ih-^- 

H = -i h{-, H} = -ih( — • V - W • 4-) 

\m dp/ (x) 

L z = —i7i{-, L z } = -i h{-,xp y - yp x } 

. / d d d d \ 

Notice that (p 2 ) A (p) 2 • The commutators of these operators are 

— 0 , [L x , Ty] — \TiL z . (2) 

(The easiest way to derive the second of these results is to apply (D.8).) 


f ? (q, p) and G(q, p) are any two functions on phase space, the Poisson bracket 
{F, G} is defined to be 


_ dF dG dF dG 
1 ’ t-l^dqidpi dpidqA 


(D-3) 


We use the Poisson bracket to associate with F an operator F on other func¬ 
tions of phase space: we define the operator F by its action on an arbitrary function 

V>( p>q) : 

Ftp = -ih{tp,F}. (D.4) 

Here h is some constant with the dimensions of the product p • q i.e., the inverse 
of the dimensions of the Poisson bracket - and is introduced so the operator F has 
the same dimensions as the function F. The factor i is introduced to ensure that 
with the obvious definition of an inner product, the operator F is Hermitian: 


dr <p* Ftp = —i h 


\n ,n 9F 

d pd q0 — • — 
d q op 




' , N , N d(/>* dF 

d pd q-r— • -x-ip - 
aq op 


,i v ,jv d(j>* dF 

d qd p —— • —ip 
op d q 



When written in terms of the classical Hamiltonian operator H, the classical evo¬ 
lution equation (D.2) takes a familiar form 

i = Hrl>. (D.6) 

It is straightforward (if tedious) to show that Poisson brackets, like a commu¬ 
tators, satisfies the Jacobi identity (cf. 2.102) 

{A, {B, G}} + {B, (G, A}} + {G, {A, B}} = 0. (D.7) 

We use this identity to express the commutator of two such operators in terms of 
the Poisson bracket of the underlying functions: 

[A, B]i> = -h 2 ({{</>, B}, A} - {{V>, A}, B}) 

= h 2 {*p,{A,B}} (D.8) 

= i h{A, B}tp. 


where {A, B} denotes the operator associated with the function {A, B}. 

Let A{p. q) be some function on phase space. Then the rate of change of the 
value of A along a phase trajectory is 


<L4 
d t 


dA . dA 

dp P 9q 


•q = {A,H}. 


(D.9) 


Consequently A is a constant of motion if and only if 0 = {A, H}, which by (D.8) 
requires its operator to commute with the Hamiltonian operator: as in quantum 
mechanics, A is a constant of the motion if and only if [A, H] = 0. 
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It is instructive to repeat the analysis of §4.2 with the classical operators 
of a single-particle system (Box D.l). If we displace the system through a, its 
probability density becomes 


^'(x,p) = ip(x - a,p) 



V>(x,p) 


= exp 



V>(x,p) = exp 



i/>(x,p). 


(D.10) 


Thus p/h is the generator of displacements, as in quantum mechanics. Displace¬ 
ment of the system by 5a clearly increases the expectation of x by 5a, so with 
dr = d 3 x d 3 p 

(x) + 5a = j dTXi|>'(x,p) = f drx Tl — ip(x,p) + 0(5a 2 ). (D.ll) 


This equation will hold for an arbitrary probability density ip if and only if 


i Ti5ij 


/ 


dr XiPjip 


dr (pjXi)*ip = ih 


d T{Xi,Pj}lp, 


(D.12) 


where the second equality uses the fact that pj is Hermitian. Thus equation (D.ll) 
holds if and and only if the Poisson brackets {xi,pj} rather than the commutators 
[; Xi,Pj] satisfy the canonical commutation relations. This crucial difference between 
the quantum and classical cases arises from the way we calculate expectation values: 
in classical physics the quantum rule ( Q ) = (ip\Q\ip) is replaced by 


(Q) = J d N qd N pQip, (D.13) 

where (i) Q is the function not the associated operator, and (ii) ip occurs once not 
twice because it is a probability not a probability amplitude. On account of these 
differences, whereas equation (4.21) yields [ Xi,Pj\ = i TiSij, its classical analogue, 
(D.ll) yields {xi,pj} = Sij. 


Appendix E: Lie groups and Lie algebras 

A group is a set of objects that is equipped with an associative product and an 
identity member: given any two members a,b of a group Q, there is a member of 
Q, c = ab, and there is a member 1 such that la = a for any a in Q. For any a 
there must be an inverse a -1 such that aa -1 = 1. The associativity of the product 
means that for any a, b, c £ Q, (ab)c = a(bc). If the multiplication is commutative 
(ab = ba always) then the group is Abelian. 

Some groups, such as that of the rotations that turn a square into itself, have 
a finite number of (discrete) members, but others, such as the group SO(3) of all 
three-dimensional rotations have an infinite number of members. 1 Moreover, in a 
group such as SO(3) some members are almost identical to other members because 
they are rotations through almost the same angle around almost the same axis. A 
Lie group is such a continuous group. 2 

The linear transformations of vector spaces provide representations of groups, 
that is, concrete realisations of an abstract group in which each group member is 
represented by a linear transformation in such a way that the product of group 
members is faithfully represented by the compounding of transformations. We gen¬ 
erally quantify the linear transformations of an n-dimensional vector space with 
n x n matrices, so a representation of a group consists of a rule associating each 
group member a with a matrix M“ such that if ab = c, then M“ • M b = M c . 

In Chapter 4 we saw how translations and rotations of a system are mir¬ 
rored by unitary transformations 17(a) and U(a) of the system’s ket \ip). These 
unitary transformations form representations of the groups of translations and ro¬ 
tations. So group theory, and especially results relating to possible representations 
of groups, are important for quantum mechanics. In particular, quantum mechan¬ 
ics enormously extends the range of representations that are physically significant 
by introducing the usually infinite-dimensional (Hilbert) space of possible kets. By 


1 The group formed by orthogonal rotations in n-dimensional space is called SO(n). 
Each group member is most naturally represented by a nx n real orthogonal matrix with 
unit determinant. 

2 Technically, a Lie group is a group that is also a differentiable manifold. 



292 


Appendix F: The hidden symmetry of hydrogen 


contrast, in classical physics groups generally only have finite-dimensional repre¬ 
sentations. 

The structure of a Lie group is largely determined by the group members that 
lie near the identity, because by left-multiplying any member of a neighbourhood 
of the identity by a, we can map this neighbourhood into a neighbourhood of a. 
Moreover, if we repeatedly multiply the identity by a nearby group member, we 
can generate members that lie far from the identity, and if the structure of the 
group is simple, we can generate any group member. This logic becomes very 
clear when we work with a quantum-mechanical representation: a member near 
the identity of SO (3) is represented by the matrix I — ihct ■ J, where |<Sa| <C 1 and 
the result of repeatedly multiplying the identity by this member is represented by 
U{a ) = e -ia J (eq. 4.12). 

The infinitesimal generators of the group - in the case of SO (3) the angular- 
momentum operators Ji form a Lie algebra because they have the following 
properties: (i) they can be multiplied by (complex) numbers and added, and (ii) 
they are equipped with an antisymmetric product (the commutator) which (a) 
satisfies the Jacobi identity 

[A, [B, C\] + [B, [C, A]] + [C, [A, B]] = 0, (E.l) 

and (b) evaluates to a linear combination of the original generators. That is, if 
7i,...,7„ are n operators which satisfy [7,,7j] = iX^fc where Cy is a set of 

complex numbers, then the set {7;} forms the basis of an n-dimensional Lie algebra. 
In the case of SO(3), n = 3 and Cy = iey*,. 

The group SU(2) formed by 2 x 2 unitary matrices with unit determinant 
plays a big role in quantum mechanics because it is an important subgroup of the 
group of Lorentz transformations. Its Lie algebra is identical to that of SO(3), so 
here we have a manifestation of the fact that a Lie group is not entirely determined 
by its Lie algebra, although it nearly is. In fact, SU(2) is essentially an extension 
of SO(3). 

The group SU(3) of 3 x 3 unitary matrices with unit determinant is crucial 
for nuclear physics because the quantum field whose excitations we call quarks 
provides representations of SU(3). 


Appendix F: The hidden symmetry of hydrogen 


The gross structure of hydrogen is degenerate with respect to both l and m; there 
are states with different (l, m) that have the same energy. Degeneracy with respect 
to m is to be expected whenever the Hamiltonian is rotationally symmetric, but 
degeneracy with respect to l is special to the Coulomb potential and indicates the 
presence of additional symmetry. 

To uncover this symmetry, we define the Hermitian Runge-Lenz operator 
(Problem 8.15) 

M = ift(p x L-L x p) -. (F.l) 

2 47reo r 

The important algebraic properties of this operator are derived (after lengthy 
computations) in Problem 8.15. Crucially it commutes with the Hamiltonian, 
[M, 77] = 0, so each component of M is associated with a conserved quantity 
(§2.2.1) and the unitary transformations that M generates, U(G) = exp(— i6 ■ M) 
where 6 is a triple of real numbers, transform stationary states into stationary 
states of the same energy because these transformations are dynamical symmetries 
of 77 (§4.3). 

The components of M have the commutation relations 

[Mi,Mj] = —2ih 2 mH UjkLk- (F.2) 

k 


Consequently, the action of U(0) on stationary states of energy E < 0 can be 
obtained by using the generators 


M' 


1 

(2h 2 m\E\y/ 2 


M, 


which have the commutation relations 


(F.3) 


[Mi , Mj ] — i ^ ^ CijkLk j 
k 


(F.4) 
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Since M' is an (axial) vector operator, from §4.2 we know that the commutators 
between its components and the orbital-angular momentum operators are 

= i 'y tjjkMk- 

k 

From equations (F.4) and (F.5) it follows that the components of 

= |(L±M') 

commute with each other because 

[M+,M~] = i([. Li, - [Li, Mj] + [Mi, Lj] - [Mi, Mj]) 

= \ ^2 Ujk(Lk — Mk + Mk — Lk) = 0 

ijk 

I ([Li.Lj] ± [Li, M^ ± [M u L^ + [Mi, M - j]) 

3 e o k (Lk ± Mk ± Mk + Lk) = i'^2 tiikMjjr. 

k k 

Thus the components of M + and M satisfy the commutation relations we first 
encountered in §4.2 in connection with the rotation operators. These relations 
define the Lie Algebra of the group SU(2) formed by 2x2 unitary matrices with unit 
determinant - for a very brief account Lie groups and Lie algebras, see Appendix E. 

The group SO (4) formed by the orthogonal rotations of 4-dimensional vectors 
has just the Lie algebra that is formed by the operators M*, namely two sets of 
three operators that commute with each other like the angular-momentum opera¬ 
tors, with any operator from one set commuting with any in the other set. This 
coincidence of the Lie algebras establishes that the invariance group of hydrogen is 
SO(4). The manifest spherical symmetry of the atom is just the SO(3) sub-group 
of SO (4). The extra dimension of symmetry is obscure because in the classical 
limit it is not a point transformation - a mapping of spatial points into spatial 
points ■ but involves momentum in an essential way, so if (x',p') is the image of 
the phase-space point (x, p), x' depends on p as well as on x. 

The operators M[ that generate the hidden symmetries of hydrogen do not 
commute with t 2 , so a general symmetry transformation U(0) will turn an eigen¬ 
state |Z) of L 2 into a state that is not an eigenstate of L 2 (Problem 8.15). For a 
judicious choice of 6 it is possible move \l) into a different eigenstate |Z ± 1) of L 2 . 
This is almost what the ladder operators A; and Aj of §8.1 accomplish - almost 
because A; and Aj only modify the radial part of the wavefunction, and we have 
to change the angular part from Yj 71 to Y^ by hand. 

From the results above and our work with the angular-momentum operators, 
we know that a complete set of commuting operators for hydrogen will consist of 
H, one component of each of M 7 * 1 , say Mf and Mf, and the operators (AL+) 2 + 
(Mj)") 2 + (A/+) 2 and (Af“) 2 + ( M~) 2 + (Mf) 2 . So we could label the members 
of a complete set of hydrogen’s stationary states, | E, mf ,mf ,m + , m~), with their 
eigenvalues with respect to these operators. These five eigenvalues correspond to 
the five classical constants of motion of Kepler orbits. Five constraints on the six 
phase-space coordinates confine the orbit to a one-dimensional sub-manifold (an 
ellipse) of phase space. Indeed, in any fixed spherical potential, the direction of 
the angular momentum vector confines the orbit to a plane, and the magnitude 
of the angular momentum and the energy together determine the radial scale and 
eccentricity of the orbit. The fifth constant of motion, provided by the hidden 
symmetry of the inverse-square law, fixes the point of closest approach to the 
centre of attraction (the periapsis). The precession of the periapsis of Mercury’s 
orbit demonstrates that the planet moves in a force field that deviates from the 
inverse-square law. About 10% of this deviation is caused by general relativity and 
the rest arises from the gravitational fields of the other planets, predominantly that 
of Jupiter. 


while 

[Af±,M±] = 


(F.5) 

(F.6) 

(F.7) 

(F.8) 
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Appendix G: Lorentz covariant equations 


Special relativity is about determining how the numbers used by moving observers 
to characterise a given system are related to one another. All observers agree on 
some numbers such as the electric charge on a particle - these numbers are called 
Lorentz scalars. Other numbers, such as energy, belong to a set of four numbers, 
called a four-vector. If you know the values assigned by some observer to all four 
components of a four-vector, you can predict the values that any other observer 
will assign. If you do not know all four numbers, in general you cannot predict 
any of the values that a moving observer will assign. The components of every 
ordinary three-dimensional vector are associated with the spatial components of a 
four-vector, while some other number completes the set as the ‘time component’ 
of the four-vector. We use the convention that Greek indices run from 0 to 3 while 
Roman ones run from 1 to 3; the time component is component 0, followed by the 
x component, and so on. All components should have the same dimensions, so, for 
example, the energy-momentum four vector is 


(p°,p\p 2 ,P 3 ) = ( E/c,p x ,p y ,p z 


(G.l) 


The energies and momenta assigned by an observer who moves at speed v parallel 
to the x axis are found by multiplying the four-vector by a Lorentz transformation 
matrix. For example, if the primed observer is moving at speed v along the x axis, 
then she measures 


(G.2) 


( E ' /c \ 


( 7 

-/?7 

0 

°\ 

( E /°\ 

Px 


-/?7 

7 

0 

0 

Px 

Py 


0 

0 

1 

0 

Py 

V p'z / 


V 0 

0 

0 

l) 

V Pz ) 


where fi = v/c and the Lorentz factor is 7 = 1/ y/l — /3 2 . The indices on the 
four-vector p are written as superscripts because it proves helpful to have a form 
of p in which the sign of the time component is reversed. That is we define 


(po,Pi,P 2 ,P 3 ) = (~E/c,p x ,p y ,p z ), 


(G.3) 


and we write the indices on the left as subscripts to signal the difference in the time 
component. It is straightforward to verify that in the primed frame the components 
of the down vector are obtained by multiplication with a matrix that differs slightly 
from the one used to transform the up vector 


(G.4) 


The Lorentz transformation matrices that appear in equation (G.2) and (G.4) are 
inverses of one another. In index notation we write these equations 


( —E'/c\ 


( 7 

/?7 

0 

°\ 


( —E/c\ 

Px 


h 

7 

0 

0 


Px 

Py 


0 

0 

1 

0 


Py 

V Pz / 


\ 0 

0 

0 

1 ) 


\ Pz ) 


= E A1 >" 


and 


p'u = E A " 


Pe- 


(G.5) 


Notice that we sum over one down and one up index; we never sum over two down 
or two up indices. Summing over an up and down index is called contraction of 
those indices. 

The dot product of the up and down forms of a four vector yields a Lorentz 
scalar. For example 

712 

(G. 6 ) 


E +pl+p 2 y+pl = -mlc 2 


where mo is the particle’s rest mass. Observers in relative motion will typically 
assign different values to all four components of p and yet from them they will 
compute the same value of PaP^ ■ The dot product of two different four vectors 
is also a Lorentz scalar: the value of is the same in any frame. 


The time and space coordinates form a four-vector 
(x°, x 1 , x 2 , x 3 ) = ( ct,x,y,z ). 


(G.7) 


In some interval df of coordinate time, a particle’s position four-vector x increments 
by dx and the Lorentz scalar associated with dx is — c 2 times the square of the 
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proper-time interval associated with d t: 

(dr ) 2 = - = (df ) 2 - {(dz ) 2 + (dy ) 2 + (da) 2 } 

= (di ) 2 (l - J) =(dt) 2 (l-/3 2 ). 


(G. 8 ) 


The proper time dr is just the elapse of time in the particle’s instantaneous rest 
frame; it is the amount by which the hands move on a clock that is tied to the 
particle. From the last equality above it follows that dr = dt/ 7 , so moving clocks 
tick slowly. 

The four-velocity of a particle is 


dx M _ / dcf d* dy dz \ _ f dr dy dz \ 
dr \ dr ’ dr ’ dr ’ dr ) ^ \ ’ dt ’ df ’ df / 


(G.9) 


where 7 is the particle’s Lorentz factor. In a particle’s rest frame the four velocity 
points straight into the future: = (1, 0, 0, 0). In any frame 

u^vT = -c 2 . (G.10) 


The electrostatic potential 4> and the magnetic vector potential A form a four 
vector 

A 1 * = (<j>/c,A x ,A y ,A z ). (G.ll) 


Some numbers are members of a set of six numbers that must all be known 
in one frame before any of them can be predicted in an arbitrary frame. The six 
components of the electric and magnetic fields form such a set. We arrange them 
as the independent, non-vanishing components of an antisymmetric four by four 
matrix, called the Maxwell field tensor 


/ 0 

—E x /c 

Ey/C 

E z /c\ 

E x /c 

0 

B z 

S y 

Ey/C 

-B z 

0 

B x 

V E z /c 

By 

—B x 

0 / 


The electric and magnetic fields seen by a moving observer are obtained by pre- 
and post-multiplying this matrix by an appropriate Lorentz transformation matrix 
such as that appearing in equation (G.4). 

The equation of motion of a particle of rest mass mo and charge Q is 

mo^ = Qy Fx„u. (G.13) 

n-r z ' 


The time component of the four-velocity u is 7 c, and the spatial part is 7V, so, 
using our expression (G.12) for F, the spatial part of this equation of motion is 

70(E + vxB) = mo^= 7 m 0 ^+ 0 (/ 3 2 ), (G.14) 

dr dr 

which shows the familiar electrostatic and Lorentz forces in action. 

The great merit of establishing these rules is that we can state that the dy¬ 
namics of any system can be determined from equations in which both sides are 
of the same Lorentz-covariant type. That is, both sides are Lorentz scalars, or 
four-vectors, or antisymmetric matrices, or whatever. Any correct equation that 
does not conform to this pattern must be a fragment of a set of equations that do. 
Once a system’s governing equations have been written in Lorentz covariant form, 
we can instantly transform them to whatever reference frame we prefer to work in. 

Lagrangian and Hamiltonian for motion in an electromagnetic field Ev¬ 
ery particle traces a path x(t) through space-time. This path is called the particle’s 
world line. The particle’s action S[x(r)] is a functional of the world line - a 
number that depends on the whole line. In fact S^x] is given by 

S'fx] = J drs[x(r)], (G.15) 

where s is a function of x(r) and its derivatives. Quantum mechanics ensures that 
the world line taken by a particle as it moves from a given event xi to a second 
event X 2 is the one that extremises S. Thus the particle’s dynamics is determined 
by the action S, which is in turn determined by the function s[x]. 

The action S must be a Lorentz scalar since there is no obvious higher n-tuple 
into which we can fit it. The proper time r is a Lorentz scalar, so s must be a 
scalar too, and to determine the form of s we have only to ask what scalars we can 
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construct from the world-line x(r) and quantities such as A, F associated with the 
electromagnetic field in which the particle moves. 

First we note that S shouldn’t depend on our choice of origin, so only deriva¬ 
tives of x(t) such as dx/dr should occur in s, not x itself. Furthermore, the 
Euler-Lagrange equation, which is used to extract equations of motion from a La- 
grangian, involves differentiation with respect to the variable that parametrises 
position along the extremal path, in this case r. So we will get a second-order 
equation of motion, if s depends on dx/dr, but not on higher derivatives of x(r). 
Similarly, the Euler-Lagrange equation involves differentiation with respect to the 
position vector x, so if the equation of motion is to depend on F and not its deriva¬ 
tives, s should depend on A but not F. So the Lorentz scalars to consider are (i) 
|dx/dr | 2 = —c 2 and (ii) (dx/dr) ■ A we exclude |A | 2 from consideration since 
its contribution to S proves to be both gauge- and world-line dependent. So the 
simplest thing to try is 

S = J dr ^-m 0 c 2 + ■ A^ , (G.16) 


where we’ve included the rest mass mo for future convenience and Q is a constant 
scalar, which will turn out to be the charge. 

Now that we have chosen a form for s, we should find the differential equation 
of the world line that extremises S between given events xi and X 2 . Unfortunately, 
this is a non-standard problem because the elapse of proper time between these 
events depends on the world line taken between them, and the Euler-Lagrange 
equations require the integration variable to have fixed values at the end-points. 
By contrast, the coordinate time t does have fixed values at xi and X 2 , so we 
change integration variable from r to t. Since At/A t = u°/c = 7 , we obtain 


S = j dt(-m 0 cVl-v 2 /c 2 + Q^ ■ A). (G.17) 

If we are willing to restrict ourselves to non-relativistic motion, we can simplify 
equation (G.17) by expanding the square root and discarding terms smaller than 
v 2 /(?. Then we have 


S = 




+ Q^ ■ A + QcA o y 


(G.18) 


where the term with Ao arises because the vectors x and A are now three- rather 
than four-dimensional so we have to add explicitly the contributions of the time 
components to the contraction of two four-vectors. The term in the integrand 
—moc 2 yields a contribution to S that is world-line-independent and therefore 
plays no role in determining the world line. Hence we may delete this term and 
arrive at our final expression for the action of a charged, non-relativistic particle 


S = 




(G.19) 


where <j> = —Aq/c is the electrostatic potential. 

I 11 classical mechanics the Lagrangian is a function L(x, x) of position and 
velocity x = dx/dt that yields the action through 

S[(x(t)]= J AtL(x,^y (G. 20 ) 


Hence equation (G.19) states that the Lagrangian of a charged particle is 
L (x, x) = |mo|x | 2 + Qx ■ A - Qtj>. 

The classical momentum is defined to be 

dL . _ A 

p = -tt- = mox + Q A, 
ox 

and the classical Hamiltonian is defined to be 


(G.21) 

(G.22) 


H{x, p) = p ■ x — L 

p — QA 
= P- 

mo 

= Ip-QA| 2 

2 mo 


|p — QA| 2 Q p — QA a + 


2 mo 


mo 


+ Q4>. 


(G.23) 
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Appendix H: Thomas precession 


In this appendix we generalise the equation of motion of an electron’s spin (eq. 8.69) 


^ = iisxB 

d t 2mo 


(H.l) 


from the electron’s rest frame to a frame in which the electron is moving. We do 
this by writing equation (H.l) in Lorentz covariant form (Appendix G). 

The first step in upgrading equation (H.l) to Lorentz covariant form is to 
replace S and B with covariant structures. We hypothesise that the numbers S'; 
comprise the spatial components of a four vector that has vanishing time component 
in the particle’s rest frame. Thus 

(s°, s 1 , s 2 , s 3 ) = (0, S x , S y , S z ) (rest frame), (H.2) 


and we can calculate in an arbitrary frame by multiplying this equation by an 
appropriate Lorentz transformation matrix. Since in the rest frame is orthogonal 
to the particle’s four-velocity u M , the equation 

= 0 (H.3) 


holds in any frame. In equation (8.69) B clearly has to be treated as part of the 
Maxwell field tensor F IJU (eq. G.12). In the particle’s rest frame df = dr and 
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so equation (H.l) coincides with the spatial components of the covariant equation 


ds M 

dr 


gQ 

2roo 


E 7 ^"- 


(H.5) 


This cannot be the correct equation, however, because it is liable to violate the 
condition (H.3). To see this, consider the case in which the particle moves at 
constant velocity and dot equation (H.5) through by the fixed four-velocity u 11 . 
Then we obtain 

E £<«-->-gE''-'"'- <*«> 

The left side has to be zero but there is no reason why the right side should vanish. 
We can fix this problem by adding an extra term to the right side, so that 


ds M 

dr 




(H.7) 


When this equation is dotted through by u M , and equation (G.10) is used, the 
right side becomes proportional to Fy V {s IJ ‘u v +s 1 'u >1 ), which vanishes because 
F is antisymmetric in its indices while the bracket into which it is contracted is 
symmetric in the same indices. 1 

If our particle is accelerating, equation (H.7) is still incompatible with equation 
(H.3), as becomes obvious when one dots through by u M and includes a non-zero 
term du M /dr. Fortunately, this objection is easily fixed by adding a third term to 
the right side. We then have our final covariant equation of motion for s 



In the rest frame the spatial components of this covariant equation coincide with 
the equation (8.69) that we started from because u; = 0. The two new terms on 
the right side ensure that s remains perpendicular to the four-velocity u as it must 
do if it is to have vanishing time component in the rest frame. 

The last term on the right of equation (H.8) is entirely generated by the 
particle’s acceleration; it would survive even in the case g = 0 of vanishing magnetic 
moment. Thus the spin of an accelerating particle precesses regardless of torques. 
This precession is called Thomas precession . 2 


1 Here’s a proof that the contraction of tensors S and A that are respectively sym¬ 
metric and antisymmetric in their indices vanishes. S lljU A ilv = ,l ;My = 

— W,,., S VI1 A VI1 . This establishes that the sum is equal to minus itself. Zero is the only 
number that has this property. 

2 L.T. Thomas, Phil. Mag. 3, 1 (1927). For an illuminating discussion see §11.11 of 

Classical Electrodynamics by J.D. Jackson (Wiley). 
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If the particle’s acceleration is entirely due to the electromagnetic force that 
it experiences because it is charged, its equation of motion is (G.13). Using this in 
equation (H.8), we find 


ds M 

dr 


Q 

2roo 




( 9 - 2 )^^ 

Xu 



(H.9) 


For electrons, g = 2.002 and to a good approximation the extra terms we have 
added cancel and our originally conjectured equation (H.5) holds after all. We now 
specialise on the unusually simple and important case in which g = 2. 

From our equation of motion of the covariant object s we derive the equation 
of motion of the three-vector S whose components are the expectation values of 
the spin operators. We choose to work in the rest frame of the atom. By equation 
(H.2), S is related to s by a time-dependent Lorentz transformation from this frame 
to the electron’s rest frame. We align our x axis with the direction of the required 
boost, so 
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(H.10) 


The time equation implies that s° = /3s 1 , so the x equation can be written 


Sx = 7 (s X - Ps°) = 7(1 - P 2 )s 1 = -^ = - kfi 1 3 -). 


(H.ll) 


The y and z components of equation (H.10) state that the corresponding compo¬ 
nents of S and s are identical. Since s 1 is the projection of the spatial part of s 
onto the particle’s velocity v, we can summarise these results in the single equation 

S=s-^v + 0(/? 4 ) (H.12) 


as one can check by dotting through with the unit vectors in the three spatial 
directions. Differentiating with respect to proper time and discarding terms of 
order /3 2 , we find 


dS _ ds 1 / dv 
dr dr 2c 2 \dr 

Equation (H.9) implies that with g = 2 

ds _ Q_ / - 0 
dr mo 



(H.13) 

mo V c z / 

(H.14) 


where the second equality uses the relation s° = /3s 1 = (v • s)/c. We now use this 
equation and equation (G.14) to eliminate ds/dr and dv/dr from equation (H.13). 


fr = ( (v ' s)E - (E ' s)v+ 2c2s x b ) + ° (/32) 

= (S x (E x v) + 2c 2 S x B) + 0(/3 2 ). 


(H.15) 


Since we are working in the atom’s rest frame, B = 0 unless we are applying an 
external electric field. The difference between the electron’s proper time r and the 
atom’s proper time t is 0(/3 2 ), so we can replace r with t. We assume that E is 
generated by an electrostatic potential $(r) that is a function of radius only. Then 
E = — V<t> = — (d<J>/dr)r/r points in the radial direction. Using this relation in 
equation (H.15) we find 


dS 

dt 


Q 

2moc 2 


1 d£ 
r dr 


S x (r x v) + 2 c 2 S x B 


(H.16) 


When r x v is replaced by ftL/mo, we obtain equation (8.70). The factor of two 
difference between the coefficients of S in the spin-orbit and Zeeman Hamiltonians 
(8.71) and (8.72) that so puzzled the pioneers of quantum mechanics, arises be¬ 
cause the variable in equation (H.5) is not the usual spin operator but a Lorentz 
transformed version of it. The required factor of two emerges from the derivatives 
of v in equation (H.13). Hence it is a consequence of the fact that the electron’s 
rest frame is accelerating. 
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Appendix I: Matrix elements for a dipole-dipole 
interaction 

We calculate the matrix elements obtained by squeezing the hyperfine Hamiltonian 
(8.81) between states that would all be ground states if the nucleus had no magnetic 
dipole moment. We assume that these are S states, and therefore have a spherically 
symmetric spatial wavefunction ip[r). They differ only in their spins. In practice 
they will be the eigenstates | j, m) of the total angular momentum operators J 2 and 
J z that can be constructed by adding the nuclear and electron spins. We use the 
symbol s as a shorthand for j, m or whatever other eigenvalues we decide to use. 
The matrix elements are 

M SS I = (tp,s\HHFs\tp,s') = J d 3 xp(r)(s|H H Fs|s / ), (I.la) 

where 

P (t) = | ip{r)\ 2 , (I.lb) 

and for given s, s' (s|7/hfs|s') is a function of position x only. Substituting for 
Hufs from equation (8.81) we have 

M ss' = £ j d 3 xp(s| MN ■ V x {v x (^) } Is'). (1.2) 

We now use tensor notation (Appendix B) to extract the spin operators from the 
integral, finding 

Af ss / = — ^ ^■ijk^-klm{s\fJ J ^iflem\s )-f, (1.3a) 

47T .f—' 
ijklm 

where 

J "/ d3xp(r) l^? (L3b) 

The domain of integration is a large sphere centred on the origin. On evaluat¬ 
ing the derivatives of r -1 and writing the volume element d 3 x in spherical polar 
coordinates, the integral becomes 

1 = f P{r)r 2 &r J d 2 fi (.3^ - ^ . (1.4) 

We integrate over polar angles first. If j ^ l, the first term integrates to zero 
because the contribution from a region in which Xj is positive is exactly cancelled 
by a contribution from a region in which Xj is negative. When j = l, we orient our 
axes so that Xj is the 2 axis. Then the angular integral becomes 

/ dn ( 3 ^T - $-) = Jr / d6) sin 8(3 COS 2 0 - 1) = 0. (1.5) 

The vanishing of the angular integral implies that no contribution to the integral of 
equation (1.3b) comes from the entire region r > 0. However, we cannot conclude 
that the integral vanishes entirely because the coefficient of p in the radial integral 
of (1.4) is proportional to 1/r, so the radial integral is divergent for p(0) ^ 0. 

Since any contribution comes from the immediate vicinity of the origin, we 
return to our original expression but restrict the region of integration to an in¬ 
finitesimal sphere around the origin. We approximate p(r ) by p(0) and take it out 
of the integral. Then we can invoke the divergence theorem to state that 

/ r) 2 r _1 C fh '- 1 

dx ^ = / da ^^T’ ( L6) 

where we have used the fact that on the surface of a sphere of radius r the infinites¬ 
imal surface element is d 2 S = d 2 H ?-x. We now evaluate the surviving derivative of 
1/r: 

7 = —p(0) Jd 2 n^ = -fpmt, (1.7) 

where we have again exploited the fact that the integral vanishes by symmetry if 
j ^ l, and that when j = l it can be evaluated by taking Xj to be 2 . Inserting this 
value of I in equation (1.3a), we have 

M ss r ~ ^~p(9) ^ ^ €ijk£klm$jl (s|/TN*Pem |s ). (1-8) 

ijklm 


E 


tijk^klm 


Sji = E 


ijklm 


ijkm 


^ ^ €ij k tmj k 
ijkm 


Now 


tijk^-kjm 


(1.9) 
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This sum must be proportional to Sim because if i ^ m, it is impossible for both 
(ijk) and ( mjk ) to be permutations of ( xyz ) as they must be to get a non-vanishing 
contribution to the sum. We can determine the constant of proportionality by 
making a concrete choice for i = m. For example, when they are both x we have 


Cxyz^xyz T € xzyC-xzy 2. 


^ ' C-xjk^-xjk 

jk 

When these results are used in equation (1.8), we have finally 

m... = 


( 1 . 10 ) 


an) 
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In Problem 7.21 the selection rule on l is derived by calculating [L 2 , [L 2 ,®;]] and 
then squeezing the resulting equation between states ( l,mi\ and \l' ,m[). The alge¬ 
bra uses only two facts about the operators L and x, namely [Li, Xj] = i ^2 k tijkXk, 
and L ■ x = 0. Now if we substitute J for L, the first equation carries over (i.e., 
[ Ji,Xj] = i tijkXk ) but the second does not (J ■ x = S ■ x). To proceed, we define 
the operator 

X = J x x — ix. (J.l) 

Since X is a vector operator, it will satisfy the commutation relations [ Ji,Xj ] = 
i f-ijkXk, as can be verified by explicit calculation. Moreover X is perpendicular 
to J: 

J ' X = ^ ) £klm.JkJlXm i ^ ) JmXm 

klm m 


1 

2 


^ ^ €klm\Jk-> Jl\%m 




klm 


m 


(J.2) 


— 2 tklpJpXm 1 ) Jm.Xm — 0, 

klm p m 

where the last equality uses equation (1.10). We can now argue that the algebra 
of Problem 7.21 will carry over with J substituted for L and X substituted for x. 
Hence the matrix elements (jm\Xk\j'rn') satisfy the selection rule | j — j'\ = 1 . 

Now we squeeze a component of equation (J.l) between two states of well- 
defined angular momentum 


(jm\X r \j'm') = ^erst ^ {jm\J a \j" m”)(j" m"\x t \j'm!) - i{jm\x r \j'm) 


= ^2 erst(jm\J s \jm")(jm''\x t \j'm') - i{jm\x r \j'm'}, 

m" st 

(3.3) 

where the sum over j" has been reduced to the single term j" = j because [J 2 , J s ] = 
0. The left side vanishes for | j—j'\ A J- Moreover, since J-x is a scalar, it commutes 
with J 2 and we have that ( jm\J ■ x| j'm!) = 0 unless j = j', or 


^2(jrn\Jt\jrn')(jm'\xt\j'm') oc Sjp (J.4) 

m" t 


Let | j — j' | > 1, then in matrix notation we can write equations (J.3) for r = x,y 
and (J.4) as 

0 = J y z — J z y — ix 

0 = J z x — J x z — iy (J-5) 

0 = Ja;X + J y y + J z z, 

where Ja- etc are the (2j + l) x (2j + l) spin-j matrices introduced in §7.4.4 and x etc 
are the (2j + l) x (2ji / +1) arrays of matrix elements that we seek to constrain. These 
are three linear homogeneous simultaneous equations for the three unknowns x, etc. 
Unless the 3x3 matrix that has the J matrices for its elements is singular, the 
equations will only have the trivial solution x = y = z = 0. One can demonstrate 
that the matrix is non-singular by eliminating first x and then y. Multiplying the 
first equation by iJ^, and then subtracting the third, and taking the second from 
iJ z times the first, we obtain 

0 — (iJa,J y Jz)z (iJa;J z T 3y)y 

0 = (\3z3y + 3x)z - (iJ 2 - i)y. 


(J.6) 



Restrictions on scattering potentials 


301 


Eliminating y yields 


0 = {i(Jz — 1) (i J xJy — j z) — (iJxJz + Jy)(iJzJy + Jx)}z 

— { JzJxJy iJ 2 + 3 xJ y + U z 

+ 3 x3 z 3 y i(J x3 zJ X 3 y3 zJ y) 3y3x}z. 


(J.7) 


We can simplify the matrix that multiplies z by working 3 z to the front. In fact, 
using 


3 x3 z 3 y (JzJrC \3y^3z3y Jz (Jz Jx ^3y)3y \(3 z3 y iJrc)J?y 


- 3 z 3x3y SiJ^Jy + 3x3y 


(J.8) 


and 


3 x3 z3 x — JzJ, i J yJx 

2 

JyJzJy — JzJy "f iJxJ y 


J xJ zJ x + j yj zJy — Jz(J;c+Jy) ~ Jz (J-9) 


equation (J.7) simplifies to 

{iJ z (3 — J 2 — 2J 2 ) + JxJ y }z = 0. (J.10) 


The matrix multiplying z is not singular, so z = 0. Given this result, the second 
of equations (J.9) clearly implies that y = 0, which in turn implies that x = 0. 
This completes the demonstration that the matrix elements of the dipole operator 
between states of well defined angular momentum vanish unless | j — j'\ < 1. 
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The Q± operators of equation (12.3) require us to evaluate e lHt / H e ~ lH o t / h as t 
=poo. Since e ±loc is not mathematically well defined, we must check we really know 
what actually means. 

We can determine from equation (12.13) if it is possible to take the limit 
t —v =Foo in the upper limit of integration. Hence the operators will make sense 
so long as this integral converges when acting on free states. 

Let’s concentrate for a while on f!_, with |V'; 0) = T2_ |0; 0) telling us that |i p) 
and \(j>) behave the same way in the distant future. Using equation (12.13), we 
have 

• ,00 

h/>; 0 ) = O(f')|</>; 0 ) + - J &tU\t)VUo(t)\ 4 >-, 0 ). (K.l) 

To decide if the integral converges, we ask whether its modulus is finite (as it must 
be if \%j>) can be normalized). The triangle inequality |vi + V 2 I < |vi| + |V 2 1 tells 
us that the modulus of an integral is no greater than the integral of the modulus 
of its integrand, so 


dr W {t)VU 0 (t)\4>-,Q) 


< / dr Er(r)Vl7o(r)|0;O> 


(K.2) 


Since U(t) is unitary, the integrand simplifies to |Uf7o(r)|0;0)| = |V|(/>;r)| where 
|0; r) is the state of the free particle at time r. If the potential depends only on 
position, it can be written V = f d 3 x V(x)|x)(x|, and the integrand on the right of 
equation (K.2) becomes 


|U|0;r)| = (0;r|U 2 |0;r) 1/2 = 


d 3 xU 2 (x)|(x|0;r)| 2 


11/2 


(K..3) 


What does this expression mean? At any fixed time, |(x|0; r)| 2 d 3 x is the probabil¬ 
ity that we find our particle in a small volume d 3 x. Equation (K.3) instructs us to 
add up these probabilities over all space, weighted by the square of the potential - 
in other words (with the square root) we calculate the rms V (x) felt by the particle 
at time r. As time progresses, the particle moves and we repeat the process, adding 
up the rms potential all along the line of flight. 

Now 1 = (0;t|0;t) = f d 3 x |(x|</>; r)| 2 , we can be confident that for any 
given value of r the integral over x on the right of (K.3) will be finite. Since the 
integrand of the integral over r is manifestly positive, convergence of the integral 
over t requires that 


lim 

T—»■ OO 


d 3 xU 2 (x) |(x|0;r>| 2 


1/2 


< 0(t X ). 


(K.3b) 
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We began our discussion of scattering processes by claiming that the real particle 
should be free when far from the target, so it’s not surprising that we now find a 
condition which requires that the particle feels no potential at late times. 

If we neglect dispersion, |(x|0;r}| 2 is just a function of the ratio £ = x/r 
as the particle’s wavepacket moves around. Assuming that the potential varies as 
some power r~ n at large radii, we have for large r 

J d 3 x V 2 (x) |(x|0; t)| 2 ~ t 3-2 ™ y"d 3 £ V 2 (£)f(£). (K.4) 

Hence, at late times the rms potential varies as ~ T ~ n + 3 / 2 an d is certainly well 
defined for potentials that drop faster than r -5 / 2 . When dispersion is taken into 
account, we can sometimes strengthen this result to include potentials that drop 
more slowly - see Problem 12.4. 

Unfortunately, the Coulomb potential does not satisfy our condition. We will 
not let this bother us too greatly because pure Coulomb potentials never arise - if 
we move far enough away, they are always shielded by other charges. 



